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(54) CANCER-ASSOCIATED GENES 

(57) A method for detecting cancer cells in an extir- 
pated sample, which comprises monitoring changes in 
the expression dose of at least one cancer-associated 
gene selected from among genes whose cDNAs are 
those hybridizable under stringent conditions with DNAs 
having the base sequences represented by SEQ ID 
NOS: 1 to 16 in the Sequence Listing or nucleic acids 
having the base sequences represented by SEQ ID 
NOS: 1 to 16 in the Sequence Listing; and a kit, etc., for 
detecting cancer by the above method. 



< 

o 

CO 
CO 

CM 

o> 
o 
Q_ 



Printed by Xerox (UK) Business Services 
2.167/3.6 



20 



25 



30 



35 



EP 0 972 830 A1 

Description 

TECHNICAL FIFl n 



[0001] The present mvent»on relates to a method for detecting a cancer cell characterized by detecting an expression 
product of a gene capable of changing an expression level thereof owing to canceration. The present inventS ri es 
to a gene capable of changing an expression level thereof and a product of the gene owing to canceration. 
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STL ♦hohT ?r tltUte the J° P ° f tHe CaUS6S f ° r m0rtality in Japan since 1981 • and a **tric ^ncer occurs espe- 
2 In, " I , 9 "T? ReC6ntly ' th6re haS been known that there is a ^'ti-stage carcinogenic mechanism fn 

258, 603 - 607 (1992)] for wh.ch the accumulate of the abnormality in a plurality of genes including DNA repair genes 
■ tumor suppressor genes and oncogenes is essential. Generally, the instability of a gene and the inactivation of a tumor 
suppressor gene are .nvolved in the development of a cancer, and the activation of an oncogene and/or the overexores- 
sion of a growth factor are involved in the advancement and malignancy of a cancer overexpres 

TZm J^h" 81 ^'?,? indUdeS thS inStabNity 01 9ene associated witn ^normality in a DNA mismatch repair 
system and the .ratability at a chromosomal level. An example of the former includes the difference in the chain length 
of a simp e repeated sequence present in a genome between a cancer site and a non-cancer site in the same individual 
(microsate He mstab ity) [Thibodeau, S.N. et al.. Science, 260, 816 - 819 (1993)], and an example of the latter £32 
an .nterchromosomal translocation. The interchromosomal translocation may cause to express a protein which is not 
found ,n normal cells, or the interchromosomal translocation may affect an expression level of a protein even If it is 
expressed in normal cells. In fact, in human chronic myelocytic leukemia, bcr gene is fused with c-abl gene by the inter- 
chromosomal translocation, and there has been confirmed an expression of a hybrrt mRNA transcribed frorrl bcTjb, 
fusion gene, which ,s absent in normal cells. Further, there has been confirmed that an introduction of bcr-abl fusion 

[0004] The inactivation of a tumor suppressor gene includes, for example, an inactivation of p53 gene The inactiva- 

ZhiT 5 m 66 C , M S6d 3 d6leti0n Within *" 9ene ' or a point mutation in a P a ^ular portion of an 

encoding region [N.gro, J.M. et al.. Nature, 342. 705 - 708 (1989); Malkin. D. et al.. Science, 250. 1233 - 1238 (1990)1 
In addrtion since the deletion and the point mutation of the p53 gene are observed in various cancers, and are as fre- 
p™,? J*n, 9 h f / , espe , cially in cases of a 9 astric ""eer at an early stage [Yokozaki, H. et al.. Journal of Cancer 
Research and Cl,n,cal Oncology, 119. 67 - 70 (1992)]. the detection of these mutations is considered to be useful for 
detecting a cancer at an early stage. 

[0005] On the other hand, P 16/MTS1 gene has been known to be a gene which is inactivated owing to a homologous 
t nri^hU 9 h - freque ^ y homc i°9ous deletions have been observed in cases of a glioma, a pancreatic cancer and 
a urinary bladder cancer [Cairns. P. et al., Nature Genetics, 11, 210 - 212 (1995)]. P 16 Protein regulates a cell cycle 
and the abnormality in P 1 6 expression has been suggested to be involved in the canceration of a cell [Okamoto A et 
™«c? n9S the NationalAcad emy of Sciences of the United States of America, 91 , 1 1045 - 1 1049 (1994)1 ' 
0006] As the causation for the activation of an oncogene, there can be included, for example, a viral insertion muta- 
hon in a prox.rn.ty of an oncogene and an interchromosomal translocation. For example, a viral insertion mutation has 

S Z "rS "] y T,w ma ° f 3 ChiCke " WhiCh iS rauSed by an avian leukosis virus < ALV )- ln this case, it has been 
found hat DNA of an ALV is inserted in the proximity of a gene c-myc, and, by potent viral enhancer and promoter a 
normal c-myc is overexpressed. and a new sequence which is different partially from the normal gene has been 
expressed. In addition, in a certain kind of human B cell tumor, there has been confirmed that c-myc, which is one of 
oncogenes, is located near a potent transcription signal of immunoglobulin by the interchromosomal translocation 
whereby increasing its expression level of the mRNA. In this case, no difference has been found between a protein for 
c-myc in a cancer cell and a protein for c-myc expressed in a normal cell, and the canceration is considered to be 

? US 7nl an o C ^! e the e****™ level of the c-^KC mRNA [Watson, J.D. et al.. Molecular Biology of Recom- 
binantDNAs. 2nd Ed., MaruzenK.K., 305 -308 (1992)]. 

[0007] An overexpression of a growth factor includes, for example, an overexpression of C-Met which encodes a 
hepatocyte growth factor receptor. There has been confirmed that the abnormality in expression of the C-Met is 
observed as an expression of mRNA having the length of 6.0 kb which is not found in a normal mucous membrane at 
an early stage of gastric cancer [Kuniyasu. H. et al., International Journal of Cancer, 55, 72 - 75 (1 993)] or is observed 

nrlri. r? UenCy ' u d a D corralation between ,he 9ene amplification and the degree of the cancer malignancy is 
observed [Kumyasu, H. et al., B,ochemical and Biophystal Research Communications, 189 227 - 232 (1 992)] 
[0008] As examples of confirming the correlation between the gene abnormality and the degree of cancer malignancy 
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in addition to that of the c-Met mentioned above, there have been confirmed that an amplification and/or an overexpres- 
sion of an oncogene Q-erbB2 gene is found in mammary cancers, ovarian cancers, gastric cancers and uterine cancers * 
[Wright, C. et al., Cancer Research, 49, 2087 - 2090 (1989); Saffari, B. et al., Cancer Research, 55, 5693 - 5698 ' ■* 
(1995)]; and that an amplification and/or an overexpression of an oncogene K-sam gene is found in a pooriy-differenti- 
s ated adenocarcinoma which is one tissue type of gastric cancer [Tahara, E. et al., Gastric Cancer, Tokyo, Springer-Ver- 
tag, Published in 1993, 209 - 217], respectively. 

[0009] As described above, the information concerning the gene involved in the development and the advancement 
of a cancer as well as the abnormality of such genes has been increasing, and the genetic diagnosis of a biopsy mate- 
rial may serve for an early diagnosis and an assessment of the degree of malignancy of a cancer. However, since a car- 

10 cinogenic mechanism comprises multiple steps and requires an accumulation of a plurality of mutations, a large part of 
the genes associated with the canceration have still yet been unknown, and further study is necessary. Recently, a gene 
therapy in which a normal p53 gene is introduced into a cancer cell whereby suppressing the proliferation of the cancer 
cell is now at a stage of a clinical trial.. Therefore, the solution for a cancer-suppressing gene can shed light not only in 
the diagnosis but also in the gene therapy. 

is [0010] Accordingly, a first object of the present invention is to provide a method for detecting cancerated cell and a 
method for determining a degree of malignancy, on the basis of finding a gene usable as an index for carcinogenesis, 
particularly a gene capable of changing expression conditions thereof by canceration of a cell, and measuring an 
expression level of the gene in a resected specimen. A second object of the present invention is to provide a kit used 
for the above method for detecting a cancer cell and/or a method for determining a degree of malignancy of the cell. A 

20 third object of the present invention is to provide a method for controlling proliferation of a cancer cell by using a sub- 
stance specifically binding to a gene capable of serving as an index for carcinogenesis or an expression product of the 
gene. Furthermore, a fourth object of the present invention is to provide a novel peptide associated with canceration, 
and a nucleic acid encoding the peptide. 

[0011] To summarize the present invention, a first invention of the present invention is an invention pertaining to a 
25 method for detecting a cancer cell in a resected specimen, characterized by determining a change in an expression 
level of a gene selected from genes of which cDNA corresponds to a DNA comprising a nucleotide sequence as shown 
in any one of SEQ ID NOs: 1 to 16 in Sequence Listing, or a DNA capable of hybridizing with a nucleic acid as shown 
in any one of SEQ ID NOs: 1 to 1 6 in Sequence Listing under stringent conditions by, for example, determining a change 
of an expression level of mRNA or a change of a protein expression level. 
30 [001 2] A second invention of the present invention is an invention pertaining to a kit for detecting cancer by the method 
for detecting of the present invention, characterized in that the kit comprises as an essential constituent any one of prim- 
ers for amplifying mRNA as an index for a change in an expression level, a probe capable of hybridizing with the above 
mRNA, or an antibody recognizing a protein as an index for the change in expression level. 

[0013] A third invention of the present invention is a method for controlling proliferation of a cancer cell by using a 
35 substance specifically binding to the gene or an expression product thereof, characterized in that cDNA of the gene cor- 
responds to a DNA comprising a nucleotide sequence any one of sequences of SEQ ID NOs: 1 to 16 in Sequence List- 
ing, or a DNA capable of hybridizing with DNA as shown in any one of sequences of SEQ ID NOs: 1 to 16 in Sequence 
Listing, wherein the method gives transcriptional control of the gene and/or functional control of an expression product 
thereof, and the like. 

40 [0014] A fourth invention of the present invention is an invention pertaining to a peptide usable for detecting cancer 
and a nucleic acid encoding the peptide, characterized in that the peptide consists of an amino acid sequence compris- 
ing an entire portion of an amino acid sequence as shown in SEQ ID NOs: 17 to 19 in Sequence Listing or a partial 
portion thereof and a nucleic acid encoding the peptide. 

[001 5] A fifth invention of the present invention pertains to an antibody usable for detecting cancer, the antibody rec- 
45 ognizing the above peptide of the fourth invention. 

[001 6] Incidentally, the term "resected specimen" used in the present specification refers to blood, urine, feces, tissue 
resected by surgery, and the like. On the other hand, the term "cancer-associated gene" refers to a gene in which the 
expression conditions thereof change with canceration. of a cell. 

so DISCLOSURE OF INVENTION 

[0017] In order to achieve the objects mentioned above, the present inventors have found a cancer-associated gene 
by comparing the intracellular expression levels of genes between a cancer tissue and a control normal tissue of a can- 
cer patient, and they have found that cancer cells can be detected by comparing the expression level of this gene. In 
55 addition, they have found a novel gene in this cancer-associated gene, whereby completing the present invention. 
[0018] The terms "cancer tissue" and "control normal tissue" used in the present specification mean a tissue consti- 
tuting a region of cancerous lesion in a multicellular individual and a tissue constituting a region which is identical spa- 
tially to the cancer tissue in the same individual but functions normally. 
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[0019] 

Figure 1 is an autoradiogram showing electrophoretic patterns of the resulting DNA fragment in a case of detecting 
cancer-associated genes by DD method. 

F^ure 2 is an autoradiogram obtained by electrophoresis RNA and then hybridizmg a labeled probe with a 
desired mRNA, in a case of detecting a change in an expression level of mRNA of cancer-assoc.ated genes by 
Northern hybridization method. , o nrtfl 

Figure 3 is apicture showing electrophoretic patterns of the resulting DNA fragment .n a case of detecting a change 
of expression of a cancer-associated gene by RT-PCR method. 

RFST MODE FOR CARRYING Ol IT THF INVENTION 

K)020] The present invention will be explained concretely below. 

[0021] The first invention of the present invention provides a method for detecting a cancer cell using an expression 

ZS"^S^^^ canceration is a gene capable of changing ^ ~ 
hereof by canceration of a cell, namely, a gene of which expression is sign«icantly induced or * 
can be detected by for instance, analyzing copy number of the gene on genome or a pattern for translocat.on >n chro- 
^^T^£ng an expression .eve. of a gene product in a normal cel. and a cancerated cel. to identify a gene 
hXLrences in both ce.ls.The gene product includes, for example, mRNA transcribed 
protein which is a translation* product. In the detection in the present invention for a ca " cer ^ S ^ 
cient to use as an index an expression level of mRNA, in which various methods have been developed or . s analysis 
w * e p Sgres Tgene mansion technique. Procedures for confirming a change in an expression level of a gene 
Tng as an index aS expression leve. of mRNA includes subtracts hybridiza tion me «PJ™^ 
Cell 21 709 - 715 (1989)], Representational Difference Analysis (RDA) method [Lsrtsyn. N. et al.. Sc ence, 259 946 
951 199?)] molecular index method (Japanese Patent Laid-Open No. Hei 8-322598), Differential Display (DD) 
mtthod S P and Pardee. A.B.. Science, 257. 967 - 971 (1992)]. and the like. Among them, smce the procedures 
7Z TDD Sod are simple/the DD method is suitable for screening a gene in the present mvention. The > method fo 
screen^ gene by using the DD method utilized in the present invention will be described ,n data. 

S First. mRNA is converted to cDNA by carrying out a reverse transcription reaction with a JwrnaDNJ- 
emoved crude RNA sample resulting from treating each RNA individually extracted from a cancer fssue and a control 
ZaTlSfto be combed wi^DNase, together with an oligo(dT) ^^^^^Z^ 
(RTase). Thereafter, the nucleic acid amplification is carried out by polymerase cha.n react.on (PCR) with the ohgo(dT) 
anchor primer in combination with various random primers. 

WW Subsequently, a PCR-ampirfied product obtained separately from the tissues to be compared is subiected to 
SacrylamS ophoresis tor each amplfled product resulting from a combination of an .dentical pr-mer pair. The 
bSSSi arTcompared with each other to find a band exhibiting a deference between the normal cell and Je can- 
cer ce^ band is cut out from the gel. and a nucleic ackl contained in the band is extracted, whereby a DNA frag- 
ment whSfe considered to be complementary to a partial portion with the mRNA for the cancer-assocated gene can 

F^oSf Thereafter there is studied whether changes in expression levels of mRNA for the cancer-associated gene 
; £X tru^ — irom the DNA fragment obtained in the DD method described ^^J^ZZZ^- 

of the mRNA in a norma, tissue is confirmed to be higher than that in the cancer fssue, ,t « determined that the cancer 

associated gene is a gene of which expression level is reduced owing to canceration. On the * h f . han \ w ™™ 
Kef of the mRNA in the cancer tissue is confirmed to be higher than that in the normal fssue, t is deter- 

SSS the cancer-associated gene is a gene of which expression leve. is amplified ow,ng to cancera^ioa 
, [0026] The confirmation on an expression level of mRNA can be made, for example, by >a^,ngthe DNA f agment 

obtained subjecting a crude RNA sample extracted from either of the cancer fssue or the control normal fssue to 

Sem hyb Sation using the above DNA fragment as a detection probe, and confirming the , dM» nn the 

obe^ 

the exoression level of the mRNA is high. For example, a signal intensity can be expressed as a value for a volume of 
s TJS^ wZ** Optica. Density)] obtained from an autoradiogram, or the like^ Here, the higher the IOD value, 
it can be determined that the expression level of the mRNA corresponding to the band is high. 
[00271 When the expression level of mRNA is too low so that the change in the express™ level of the mfWJ cannot 
be confirm^ by means of Northern hybridization analysis, there can be also confirmed with more sensifve RNase pro- 
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tection assay [Krieg, PA and Melton, DA, Methods in Enzymology, 155, 397 - 415 (1987)] using as a probe RNA pre- 
pared from an amplified DNA fragment obtained by the DD method described above, which is derived from mRNA 
deduced to be expressed from a cancer-associated gene as a template. This method utilizes RNase having substrate 
specificity wherein it shows cleaving activity on single-stranded RNA, but shows no cleaving activity on double-stranded 
5 RNA. Specifically, mRNA to be detected and an excessive amount of a probe are added to a crude RNA sample 
extracted from a normal tissue and a cancer tissue-derived crude RNA sample, and the mRNA to be detected forms a 
hybrid with the added probe, whereby acting on an RNase having substrate specificity. The expression level of the 
mRNA can be confirmed by determining the amount of the double-stranded RNA remaining after the digestion with the 
RNase mentioned above. In other words, the larger the amount of the remaining double-stranded RNA, it can be deter- 
zo mined that the expression level of the mRNA is high. 

[0028] The nucleotide sequence of an amplified DNA fragment obtained by the DD method described above, which 
is derived from mRNA deduced to be expressed from a cancer-associated gene as a template, is sequenced by PCR 
direct sequencing [Erlich, H.A., PCR Technology, Stockton Press, Published in 1989, 45 - 60], or by a combination of 
TA cloning [Mead, DA. et al., Bio/Technology, 9, 657 - 663 (1991)] with a usual nucleotide sequencing method to deter- 
15 mine the nucleotide sequence, and the amounts of the amplified product as obtained by carrying out RT-PCR with an 
amplification primer which is designed based on the above nucleotide sequence information are then compared, 
whereby the mRNA expression level can be confirmed. In other words, the higher the amount of the resulting amplified 
product, it can be determined that the expression level of the mRNA is high. 

[0029] Incidentally, the amplified DNA fragment obtained by the DD method described above, which is derived from 
20 mRNA deduced to be expressed from a cancer-associated gene as a template, is not necessarily cDNA complemen- 
tary to an entire length of mRNA for the cancer-associated gene. In order to obtain cDNA for a cancer-associated gene, 
for example, a cDNA library derived from a tissue used in screening is prepared; an amplified DNA fragment obtained 
by the DD method described above, which is derived from mRNA deduced to be expressed from a cancer-associated 
gene as a template, is labeled; and plaque hybridization is carried out with the labeled cancer-associated gene as a 
25 detection probe, whereby cDNA clone for a cancer-associated gene can be isolated. 

[0030] The present inventors have succeeded in isolating 1 4 kinds of DNA fragments comprising a respective nucle- 
otide sequence of a partial portion of cDNA for cancer-associated genes. Genes expressing mRNA which corresponds 
to cDNA as shown in nucleotide sequences comprising a nucleotide for the DNA fragment thus obtained are named as 
CA11, CA13, CC24, GG24, AG26, GC31, GC32, GC33, GG33, CC34, GC35, GC36, CA42 and CC62, respectively. 
30 Correspondences between SEQ ID NOs in Sequence Listing in which a nucleotide sequence of regions presently 
determined in each nucleotide sequence of cDNA for 14 kinds of cancer-associated genes and the above name of the 
gene named by the present inventors are shown in Table 1 . 



Table 1 



SEQ ID NOs in Sequence Listing 


Name of Gene 


Nucleotide Sequence 


Amino Acid Sequence 




1 


17 


CA11 


2 


18 


CA13 


3 




CC24 


4 




GG24 


5 




AG26 


6 




GC31 


7 




GC32 


8 




GC33 


9 




GG33 


10 




CC34 


11 




GC35 


12, 15, 16 




GC36 


13 


19 


CA42 


14 




CC62 
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[0031] 



s=S5SS£S3S5S*~ •=.- 

« gene is changed owing to S^^ST* 1 '° 16 U ^ * S ,0n9 35 the °< *. 

and sodium citrate 4 41 g in 1 L of water 1? Tn J inn / , , P 6par6d by d,SSOlv,ng sodium chloride *™ 9 
« polyvinyl pyrrolidone and 10 1% FicoT ' ^ ^ ^ ° NA ' bovine serum 0.1% 

otide sequence of cDNA for CC34 ONA ac J ° Wni "J EQ ID , N0 " 10 In Sequence Listing is present in the nude- 

[Normal Level Value] = [Mean Expression Level of Cancer-Associated Gene in 
40 Normal Tissue] + 2x [Standard Deviation] Equation 1 

« EL SdtST ^ mRNA '"*<"* to ^ m^, RNa Se a^y „ 

.nadentally, the term "primer" in the present speciication refers to an oligonucleotide capable of acting as an inflation 
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strength, cofactors, and the like). Typically, the primT^ ionic 
s gene in the present specification, there L be exZS as t h l *Ll ? inStanCe ' in 9 case of CA1 1 

SEQ ID NOs: 20 and 21 in Sequence Listino S™? ! ^ P 3 comb,nation of °NAs as shown in 
present specification refers to^^ci^Jp^'r" * ^ amP "' fied Pr0dUCtS USed in the 
DNA fragment to agarose gel etaSSSi. a™ 1^2. i ^'T' 0 " ' S anW ° Ut by subjectin 9 *• am P^«J 

» labeled probe capable ol h^Z ° }T ? P T*' " nd ** 5i ° nal in ' ensil » 01 • ^ *«*<l w» a 

determined that the expression level of a omcer^SLl « J^? k e ,f acted from a resected specimen, it can be 
partially limited. Fo exanple *er e can dTus Si t0 be detected 15 hi 9 h " 7116 'abel on the probe is not 
cent substance typically exeTp I*e7b S^^i^!^? by ^ ° r a fluore *- 

co^X ^ in 3 SUfffci - - product 

position of the amplified DNA fragment *" 96 ' With RBr ' and COnfirmin 9 fram *e 
bedeterminedthattheexpreS 

5 mine the expression -t is also possibie to deter- 

fluorescent intensity. OD 0f a band on a f '"orescent image instead of a 

applied in the step of L facd am^^ N °- Hei 5 " 504886 ) ™V be 

» method includes adding a known amoS^uJSS. h * . . ^. 9 ^ PUrp0Se msntioned above - A typical 
used in ampliation of a desTi gle and toSSSZ^ . ** termi " a,S the nUC ' e0tide s ^" ces 
amplifying by PCR reaction ^^1^^ ^ ^ se ^ uences and ™es as an internal standard and 
proSu* in the light he^^^^^ the final ampl « ied «~- * *e desired 

not limited to an externally a*wSSu?13 htl m , ? Pr6S6nt an internal is 

of a gene expressing in anornS ES^cIS ZS* by ^ 38 3 t6mp,ate mRNA 

be incjud* cDNA £ P -actin gene' ^.^^^."oXS^ AS "* * — - 

used as a primer pair for nucleic acid awoirZ^rLr^^ I NOs: 20 and 21 in Swce Listing are 
asshowninbJnumbers^^ 

CA1 1 gene in the present ^^^ J^!^^^ ^ ° f thS "ucteodd. sequences of a 
to be detected or a partial portotn^^ Wth mRNA for a ^^er-associated gene 

may, for example, be used '™£wS2L a S Si ?5 > k P ? 8 * n0t P3rtCUlar,y limited - and ther * . 
exempted by fluorescein tyP ' Ca " y eXemp " f,ed by P ' or a ,luores « n * substance typically 

lining on a nylon filter, or the l^b^lS^^^^T b " 8d ° n the m0,ecular wei ^ 
excess amount of a probe for deTedinS ZJ^-'"? 0 *" ^ t0 be deteCted int0 contact with an 

izing with the immobl^RN ^^^^££5^ h ? 9nal intenSity ° bt3ined from ,he prabe h ^ 
level of the cancer-associated genels high 9 9n8 ' * Ca ° be determined 1,16 - W" 
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high. 

20 



4 0 fnnS Th ! ' '-labeled protein A, a peroxidase-linked anti-IgG antibody, and the like 

an b T nti ° n Pr0Vk,6S 3 " for de,6Ctin 9 3 Cancer worts, there 

in.l7n^ ? 9 3 CanC6r Ce " by utl ' li2ing the method for detecting a cancer cell which is the fitt 

IT of a Presen, ."^ ntl0n - Concretel * »«• can be exemplified a Wt for detecting the change 

i^v^ssjsar within cells using as " index an amount * mRNA - « ^ -~ 

" Son rnl^L 3 f ,f dete H tin9 3 , CanCer Ce " USi " 9 33 an ind6X an «P«*n level of mRNA by using the 

„Tj?T? S , a n ^ a " y - 0,8 Wn<ls an!l ,he n "* e ' 01 * e ^-associated genes B "e d«eCby SI 
» lhe pdmer "* « s * " *°> P**A limited Z«Z b^S 

K One S3 XT ^ n,,mBe, °* ^» «> be defect ' ^ 

~sCn=^^ 

bo, of a ccrtMon o. an aipnabe, and numerate indicates the name of the primer in*, p£nt" 
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number within a parenthesis attached to each symbol indicates SEQ ID NO- in Seauence i kti™ ch™,™ * 
otide sequence of each primer. Incidentally, p-actin shown in Tab.e 2 is a ^ 

purpose of quantrfying mRNA for the cancer-associated gene in a crude RNA sample J^ZT^2 P k- 







Table 2 




Target Gene 




Primer Pair 




Size of Amplified DNA 


10 












Predicted 




CA11 


F1 


(20) 


R1 


(21) 


366 bp 




CA13 


F2 


(22) 


R2 


(<d6) 


168 bp 




CC24 


F3 


(24) 


P3 


(25) 


259 bp 


15 


GG?4 


F4 


(26) 


R4 


(27) 


384 bp 




AG26 


F5 


(28) 


R5 


(&) 


389 bp 




GC31 


F6 


(30) 


R6 


(61) 


213 bp 


20 


GC32 


F7 


(32) 


P7 


(66) 


251 bp 




GC33 


F8 


(34) 


R8 


(35) 


563 bp 




GG33 


F9 


(36) 


R9 


(37) 


218 bp 




CC34 


F10 


(38) 


R10 


(39) 


241 bp 


25 


GC35 


F11 


(40) 


R11 


(41) 


157 bp 




GC36 


F12 


(42) 


R12 


(43) 


95 bp 




CA42 


F13 


(44) 


R13 


(45) 


245 bp 


30 


CC62 


F14 


(46) 


R14 


(47) 


134 bp 




p-Actin 


F15 


(48) 


R15 


(49) 


264 bp 
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[0058] On the other hand, in the case of a kit for detecting a cancer cell using as an index mRNA by usina a detection 
method employing RNase protection assay or Northern hybridization method * is an essentiSrement fofaTon 

1™"° 1 T ""J ^ There, ° re ' 3 Pr ° be COnStitutin 9 Wt is not Particularly IfcffaSi^S^ 
S on » T ? T th ? ndS ^ thS nUmb6r ° f * e ^-associated genes to be detected 
SJS. f ' tfl6 0356 01 3 kit for detecting detection method w « h a" anybody it is an essential a can- 

cer cell us>ng a protem as an index by using the constituent to have an antibody which has the SwlSSESiSi 
above n connection wrth the method for detecting a cancer cell and is capable of binding inSS^SSSS^ 
iKET 6 ^ 35 ^ ° ne °' SEQ 10 N ° S: 1 10 16 in Sequence U^^^SS^ 

^^l^nTnSK? 3 nUC ' e0tide SeqUenC6 ° f ° NA CaPable ° f h ** d ™° -derstringenrcondi^s ^ t 
to ifi ™ cleobde u se <l uence ™W™q the nucleotide sequence as shown in any one of SEQ ID NOs- 1 

SJSJ? 1 ^* ° f the cancer ' associ ^ 9en eS to be detected by this kit are not partly limied 

ijdo n thl id rt ^ C ° n f ^ th ' S * iS n0t Partcularly limited ' and * ™V °e selected eppnJriawfcCSS 
upon the krnds and the number of the cancer-associated genes to be detected appending 

S JlT!!S SUC ^ Wt ' 3 C3nCer Ce " CaP b6 det6Ct6d ™ re simpl * Th « h ^ " is P^sible to diagnose a cancer 
based on the determ.ned expression level of a cancer-associated gene by using such a kit In other words hmians 

otte^m^^ 

S sD^al'Z^t? T Pr6Sent iS 3 ^ far COntr0 " in9 proliferation of a cancer eel' ^ing a sub- 

*L 9 cancer - associat ed gene or an expression product thereof. The specific binding sub- 
stance referred ,n the present speculation can, for example, include nucleic acids, antibodies. cytoSc T 



9 



1994) 

20 
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(CTL), and the like. 

I^^u?T POnd8 10 P19 °" B 96ne [BUrbel °' PD - 6t aL - J - S/0/ - C " em - 2 70. 3091 9 - 30926 (1995)1 G Sl T 

eZl n ,Th 1 ^ SUbUnit " 9ene [P0W6r ' M D " et al - ^ ^ «es 17 6734 989 ■' Gtt2 cT 
responds to cytochrome b gene [Anderson, S. et al., Nature 290 457 - ars mqhim- rr„ ' 1 ! ,J ' 

subunit gene [Tamura. R.N. et al., Journa, of Cell B olSu 1593 - 1 6 S K- ?% 3 /° rreSp0, * S t0 '" te 9 rin * 6 
^^^^ 

J^liSSSSS •? i rrr acM h ? ""^ in,enfion ,he "~ 

- arsr - ,"f ,o — — * ^ * and ha * 9 a ^ * 



40 
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15 



sssKdt:"^. m " amn ° aads bound 10 -* ° ta * ■»* - * ««- » j^h 



D NO. 1 ,n Sequence Lstmg; the peptide having the amino acid sequence as shown in SEQ IDNO 1 i 

Sn.V.T 6 "^ nUmb6rS 1698 t0 1850 0f ,he n ^« de «W as shl in SEQ 7 D N T 2 n 

Sequence Lstmg; the peptide having the amino acid sequence as shown inSEQ ID NO- 19 in S P n , ™ rr 

S suhSZ „ ^! 96 ' n ' tS ex P ress,on leve| . "herein one or more amino acids are added deleted 

or substrtuted ,n the ammo acd sequence as shown in any one of SEQ ID NOs: 17 to 19 in Sequence Ustint' ?££ 

thafeven whl 9 2 TT T? aC 'l Sequence have been P«*™* ™ere cannot, therefore, be dJdZ£2E 
hat even when the nucle.c acd encoding a particular amino acid sequence is isolated, a variety of nucleic aciSs enco? 

SS^^rs^-*: 7* t praduction of a protein by 9enetic ^^SSs^ 

onSrtnn I „ t h 8 C " 9, *** awton 0f the desired P rotein * thieved by artificially 

acXuent dil STn h^^ 
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[0074] In fact, in the present invention, nucleic acids of which nucleotide sequences are slightly different but the amino 

?in thTrV; 00 " 6 " ' S ' dentiCal iS ° btained Alth0ugh R atbase numbe7l784 is A, and K at base number 19^"'s 
T in the nucleotide sequence as shown in SEQ ID NO- 2 in Seauence Li**™ nt «,h;,-h ht , « S 

' i s G 1985 iS T; 8 nUd * add h «** R * "~ ~*« "84 is A. a S K £££^ 19 £ 

Example 1 Analy^ ff f Cancer-Assn^ i flt oH n P p° 
so 1) Confirmation of mRNA Which Can Serve As Index for Detecting Cancer 

SSthunn 6 " as ? onfirmed whether ° r n °* "™A of which expression level was changed by canceration was 

DD (ma ufectured by Takara Shuzo Co., Ltd.) in accordance with the procedure/described fn the i^SZSl 



45 



SO 



SS 



[0081] Specifically, reverse transcription reaction was carried out per one reaction by mixing 200 no of the crude rnia 

n « 2 ^ I M SeqUenCe L ' St,ng as a primer > therea «e r heat-treating at 70°C for 10 minutes sub- 
jecting to rap ld coohng. and subsequently reading with AMV reverse transcriptase at 55»C for 30 minutes Otto down 
stream pnmers were individually reacted in the same manner to prepare 9 Ls of *i^^Si^^ 

PcSL£ l^So? J*? 0 . ampWication reaction * p ™. a nucleic acid simplification was carried out by 
£ «L ^ 2 * sin 9 |e - stranded cDNA = described above as a template an oligofdT) primer ideS 

2^85^SS R r "SiSSS? b/ f di " 9 3 mM M9C ' 2 ' 1 5 ^ e3Ch 0< dATP ' dGTP ' dCTP and dTTP as substrates 
whtlin n ^ [a " ] P (manufactured b y Amersham) as a labelling compound, and reacting for 40 cycS' 

nal o, th! 0 " r" 5 ' 515 ° f 31 , 94 ° C ^ 30 S6C0ndS ' 3t 4 °° C ,0r 60 second * *" d « 72-C for « «^^2r£ 
turation at 90 C for 2 minutes to obtain a sample for electrophoresis. The electrophoresis was carried out on a 7 M urea- 
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[0084] As one example, the results where D4 having the nucleotide sequence as shown in SEQ ID NO- « in 
pafe* rth a poolers*,* «LocLJ^St^^JT?i *««sd f, om a normal tissue of a 

tissue , or each of genes which was detected by the £^ 



25 



Table 3 





Name of Gene 


Primer Pair 


Approximate Size of 


Difference in Amount of DNA 










Amplified DNA fragment 


fragment 






Upstream 


Downstream 








CA11 
CA13 


U1 (50) 


D4 (59) 


750 bp 


Cancer Tissue < Normal Tissue 




U1 (50) 


D4 (59) 


620 bp 


Cancer Tissue > Normal Tissue 


35 


CC24 


U2 (51) 


D5 (60) 


800 bp 


Cancer Tissue > Normal Tissue 




GG24 


U2 (51) 


D9 (64) 


480 bp 


Cancer Tissue > Normal Tissue 




AG26 


U2 (51) 


D3 (58) 


550 bp 


Cancer Tissue < Normal Tissue 




GC31 


U3 (52) 


D8 (63) 


440 bp 


Cancer Tissue > Normal Tissue 


40 


GC32 


U3 (52) 


D8 (63) 


310 bp 


Cancer Tissue > Normal Tissue 




GC33 


U3 (52) 


D8 (63) 


300 bp 


Cancer Tissue > Normal Tissue 




GG33 


U3 (52) 


D9 (64) 


410 bp 


Cancer Tissue > Normal Tissue 


45 


CC34 


U3 (52) 


D5 (60) 


290 bp 


Cancer Tissue > Normal Tissue 




GC35 


U3 (52) 


D8 (63) 


210 bp 


Cancer Tissue < Normal Tissue 




GC36 


U3 (52) 


D8 (63) 


190 bp 


Cancer Tissue < Normal Tissue 




CA42 


U4(53) 


D4 (59) 


660 bp 


Cancer Tissue > Normal Tissue 


50 


CC62 


U6 (55) 


D5 (60) 


380 bp 


Cancer Tissue < Normal Tissue 



2) Identification of mRNA Serving as Index for Detecting Cancer 

55 



f [ ^DNA T f h r "!! 7 SS 5f d whether a cha "9 e in * n session level of the mRNA used as a template for an ampli- 
^JSt^ * 3 as confirmed by »e DD meL inS) 

[0087] Rrst, the studies were made by means of Northern hybridization. Spectfically, there was studied whether the 
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difference in the expression levels of the mRNA of a cancer-associated gene expressed in a cancer tissue and that in 
a control normal tissue could be detected by using each amplified DNA fragment obtained by the method in Section 1) 
described above as a probe. 

[0088] The probe for the detection was prepared as follows. Specifically, from-the acrylamide gel on which the ampli- 
5 fied DNA fragment obtained by the DD method in Section 1) described above was electrophoresed, the region contain- 
ing each amplified DNA fragment shown in Table 3 was cut out, and thereto was added 100 ^il of water and subjected 
to a heat extraction to collect individually each DNA fragment contained. Re-amplification by PCR was carried out by 
using each DNA fragment individually as a template, with a combination of the upstream and downstream primers used 
to obtain each DNA fragment shown in Table 3. Further, about 1 00 ng of each amplified DNA fragment was labeled with 
10 32 P using Random Primer DNA Labeling Kit (manufactured by Takara Shuzo Co,, Ltd.) to prepare 14 kinds of probes 
for detection. Separately from above, mRNA for p-actin gene was selected as a positive control of a crude RNA 
extracted from each tissue, and the synthetic oligonucleotide having the nucleotide sequence as shown in SEQ ID NO: 
65 in Sequence Listing was labeled in the same manner with 32 P to obtain a probe for detecting mRNA for p-actin gene. 
Thereafter, the probe for detection described above was mixed together with herring sperm DNA so as to have a con- 
is centration of 1 00 ^g/ml, and then heat-denatured. To the resulting reaction mixture was added hybridization buffer (50% 
formamide, 0.65 M NaCI, 0.1M Na-Pipes, 5 x Denhardt's reagent, 0.1% SDS, 5 mM EDTA) to prepare 15 kinds of probe 
solutions for detection in Northern hybridization. 

[0089] Northern hybridization was carried out as follows. First, 20 ng per well of a crude RNA sample extracted from 
each of a cancer tissue and a control normal tissue from the patient with a gastric cancer prepared as described above 

20 was subjected to electrophoresis on a formalin-denatured 1% agarose gel and blotted on a Hybond N + membrane 
(manufactured by Amersham). Subsequently, a blotted membrane and hybridization buffer added with heat-denatured 
herring sperm DNA so as to have final concentration of 100 ^g/ml were added to a Hybri Bag (manufactured by 
COSMO BIO). The resulting composition was allowed to stand at 42°C for 2 hours, and then the buffer was discarded 
to prepare a membrane with pre-hybridization treatment. After preparing 15 such membranes as above, to each mem- 

25 brane was added each of the 15 kinds of detection probe solutions for Northern hybridization described above, and 
hybridization was carried out at 42°C for 16 hours. Thereafter, each blotted membrane was taken from the Hybri Bag, 
and rinsed with washing solution I (2 x SSC, 0.2% sodium pyrophosphate, 0.1% SDS) at 42°C for 20 minutes, and then 
with washing solution II (0.5 x SSC, 0.2% sodium pyrophosphate, 0.1% SDS) at 42°C for 20 minutes. Incidentally, rins- 
ing with washing solution II was repeated twice with replacing the washing solution. The membrane after rinsing was 

30 wrapped with a plastic film and exposed for one day and night to a high-sensitivity X-ray film (manufactured by Kodak). 
From the signal intensity in the resultant autoradiogram, the expression level in the cancer tissue was compared with 
that of the control normal tissue. 

[0090] As one example, the results of the detection of mRNA for CA1 1 gene are shown in Figure 2. In Figure 2, 1 N 
is a lane wherein on an agarose gel was electrophoresed a crude RNA sample obtained from a normal tissue of a 

35 patient with a poorly-differentiated adenocarcinoma-type gastric cancer; and 1T is a lane wherein on an agarose gel 
was electrophoresed a crude RNA sample obtained from a cancer tissue of the same patient with the poorly-differenti- 
ated adenocarcinoma-type gastric cancer, (a) shows results obtained with a probe for detecting CA1 1 , and (b) shows 
results obtained with a probe for detecting p-actin. Since both of the 1 N and the 1T exhibited the signals obtained with 
the probes for detecting p-actin as shown in (b), it is clear that in the both samples the RNA is extracted without under- 

40 going degradation excessively. On the other hand, a clear signal as indicated by at a position near 1.1 kb was 
present only in lane 1 N but no signals were present in lane 1T as shown in (a). Therefore, it was found that the CA1 1 
was a gene of which expression level was reduced owing to canceration. Similarly, CC62 exhibited a band at about 2.6 
kb only on the autoradiogram derived from the control normal stomach tissue. GC31, GC32 and CC34 showed the 
bands at about 1 .0 kb, about 1 .6 kb and about 1 .7 kb, respectively, and in any of these genes more intensive signal was 

45 obtained for the crude RNA samples prepared from the gastric cancer tissues as compared to that of the crude RNA 
samples prepared from the control normal stomach tissues. Incidentally, the signal intensity was determined by meas- 
uring each band of an autoradiogram by a densitometer. Subsequently, IOD of each band obtained on the autoradio- 
gram was calculated with FMBIO-100 (manufactured by Hitachi Soft Engineering), and an index was calculated by the 
equation as shown below to determine whether or not a gene was a cancer-associated gene. 

50 

[Index Value] = (X x pY)/(Y x pX) Equation 2: 



[0091] In the above equation, each symbol expresses the following value: 

55 X: IOD of a band derived from mRNA for the gene shown in Table 3 obtained from a gastric cancer tissue; 

Y: IOD of a band derived from mRNA for the gene shown in Table 3 obtained from a control normal stomach tis- 
sue; 

pX: IOD of a band derived from mRNA for p-actin gene obtained from a gastric cancer tissue; and 
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pY: IOD of a band derived from mRNA for p-actin gene obtained from a control normal stomach tissue. 

[0092] The comparison on the expression level was made by carrying out RT-PCR with respect to each of the genes 
CA13, CC24, GG24, AG26, GC33, GG33, GC35, GC36 and CA42 in which no signals were obtained by Northern 

5 hybridization. In order to design a primer for the nucleic acid amplification reaction in the RT-PCR, each DNA fragment 
used as a probe in Northern hybridization was subjected to a direct sequencing by PCR, or was cloned by a TA cloning 
procedure and then sequenced by a dideoxy method, whereby determining its nucleotide sequence. The nucleotide 
sequences of primers designed based on the resulting nucleotide sequence information and used in the RT-PCR with 
mRNA derived from each of the genes as a template are as shown in any of SEQ ID NOs: 22 to 29, 34 to 37 and 38 to 

w 43 in Sequence Listing. Table 2 shows the genes together with the corresponding primers used to confirm their expres- 
sion. 

[0093] A change in an expression level of mRNA by RT-PCR was confirmed by a DNasel treatment of a crude RNA 
sample obtained from each of the cancer tissue and the control normal tissue of a patient with a gastric cancer prepared 
by the method in Section 1) described above. Thereafter, RT-PCR was carried out in a 100 reaction system of 40 ng 

15 of each treated sample with TaKaRa RNA PCR Kit Ver. 2.1 according to the procedures described in the instruction 
attached to the kit. Specifically, 40 ng of a crude RNA sample as a template and an oligo(dT) primer (final concentration: 
2.5 ^iM) as a downstream primer were used to prepare a reverse transcription reaction mixture (10 mM Tris-HCI, pH 8.3, 
50 mM KCI, 5 mM MgCI 2 , 1 mM each of dNTPs, 100 units of RNase inhibitor, 25 units of AMV reverse transcriptase), 
and the reverse transcription reaction was carried out at 30°C for 10 minutes, and at 55°C for 20 minutes and then at 

20 95°C for 5 minutes. Each 10 ^il of the reverse transcription reaction mixture was added to each 40 ^l of 10 kinds of PCR 
reaction mixtures (final concentration: 10 mM Tris-HCI, pH 8.3, 50 mM KCI, 2.5 mM MgCI 2 , 1-25 units of TaKaRa Taq 
DNA polymerase) individually containing the primer pairs (0.2 \xU) for detecting each of the mRNAs for the genes of 
CA13, CC24, GG24, AG26, GC33. GG33, GC35, GC36, CA42 and p-actin to make up a volume of 50 jnl. One cycle 
after the pre-incubation at 94°C for 2 minutes in PCR consisted of the step of incubation, at 94°C for 30 seconds, at 

25 55°C for 60 seconds, and then at 72°C for 60 seconds. The amount of an amplified DNA product was quantified by sub- 
jecting the amplified DNA product to agarose gel electrophoresis, staining the gel with ethidium bromide, calculating the 
IOD of each band on the fluorescent image with FMBIO-100 to obtain an index for determining whether or not a gene 
is a cancer-associated gene from Equation 2 shown above. 

[0094] The results of Northern hybridization method and RT-PCR described above, and the patterns of the changes 
30 in the expression owing to the canceration of each of the genes evident from these results were shown in Table 4. In 
the column of the patterns of the changes in the expression, a gene of which expression was amplified owing to can- 
ceration was indicated with "t" and a gene of which expression was suppressed owing to canceration was indicated 
with "I". Specifically, it was determined in Table 4 that a gene having an index value greater than 1 is a gene of which 
expression level was increased owing to canceration, and a gene having an index value less than 1 is a gene of which 
35 expression level was reduced owing to canceration. As a result, there were clarified that the genes CA13, CC24, GG24, 
GC31 , GC32, GC33, GG33, CC34 and CA42 were those of which expression levels were increased owing to cancera- 
tion, and the genes CA1 1 , AG26, GC35, GC36 and CC62 were those of which expression levels were reduced owing 
to canceration. 



40 

Table 4 





Name of Gene 


Index Value 


Method for Determining 
Index Value 


Patterns of Changes in 
Expression 


45 


CA11 


0.036 


A 






CA13 


6.3 


B 


r 




CC24 


2.0 


B 


t 




GG24 


2.8 


B 


t 


50 


AG26 


0.52 


B 


i 




GC31 


3.1 


A 


r 




GC32 


3.6 


A 


T 


55 


GC33 


2.3 


B 


T 




GG33 


2.2 


B 


t 




CC34 


15 


A 


t 
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Table 4 (continued) 



Name of Gene 


Index Value 


Method for Determining 
Index Value 


Patterns of Changes in 
Expression 


GC35 
GC36 
CA42 
CC62 


0.0046 
0.048 
1.9 
0.56 


B 
B 
B 
A 


I 

I 

t 
I 


(note) in the table, "A" represents a determination from the autoradio 
zation, and "B" represents a determination based on the electro^ 
amplified product by RT-PCR. 


gram in Northern hybridi- 
loretic photograph of the 



is 3) Acquisition of Cancer- Associated Gene cDNA 



45 



50 



55 



^ .... t rt CM! - 80 ,ha * lh " e 0 s ™ 5 *« e determined to be novel oenes Further n a 

GG24 on V " 96ne ' GC32 10 Cyt0Chr ° me b 96ne - GC33 10 int ^ in « 6 sZJgl 

Sin aSrnll^nMrr'^ 9 °' "V 0 ™ US ' n9 35 3 pr0be an am P li,ied DNA ^t derived from 
« T^ln^ ! S . 1 ^ ° ne haV ' nQ th6 nUCl60tide sequence as shown in SEQ ID NO: 1 0 in Sequence UrtE^ 

W«™ 1 5 nUCle0tlde Sequence in whlch a * base number 935 in the nucleotide as shown in SEQ ID NO- 10 in 
Example 2 Confirmation of Channe in Gen* Fu sion in Oanrpr Ticc.p 
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1) Confirmation of Change in Gene Expression in Cancer Tissue of Patient with Signet Ring Cell Gastric Cancer 

[0099] Using a crude RNA sample prepared in the same manner as in Section 1 ) of Example 1 from each of a cancer 
tissue and a control normal tissue excised from a patient with a signet ring cell gastric cancer who was different from 
the one provided the tissues used in Sections 1) and 2) of Example 1, the expression levels in the cancer tissue and 
the normal tissue were compared with respect to each of the 14 kinds of cancer-associated genes clarified in Section 
3) of Example 1 by using the expression level of the mRNA as an index by means of carrying out Northern hybridization 
or RT-PCR described in Section 2) of Example 1 . As one example, the results of the detection of mRNA for CA1 1 gene 
by RT-PCR method are shown in Figure 3. Specifically. Figure 3 is a photograph of a fluorescent image of the electro- 
phoresis of a DNA fragment obtained when a change in an expression level of a cancer-associated gene is detected by 
RT-PCR method. The reaction conditions of the RT-PCR were according to the method described in Section 2) of Exam- 
ple 1 . with setting two patterns in the number of the cycles of the PCR, i.e., 25 and 30. In Figure 3, (a) shows the results 
of the detection of the expression of a cancer-associated gene CA1 1 , and (b) shows the results of the confirmation of 
the expression of p-actin as a positive control. In Figure 3, 2T is an amplified DNA fragment obtained by using as a tem- 
plate a crude RNA sample extracted from a gastric cancer tissue of the patient with a signet ring cell gastric cancer, and 
2N is an amplified DNA fragment obtained by using as a template a crude RNA sample extracted from a normal gastric 
tissue of the patient with the signet ring cell gastric cancer. Also, the numerals "25" and "30" in Figure 3 are the numbers 
of the cycles of the nucleic acid amplification in the RT-PCR method. Table 5 shows the results of calculated lODs of 
the bands on the fluorescent image shown in Figure 3. Incidentally, each index shown in Table 5 was calculated from 
Equation 2 described in Section 2) of Example 1. 



Table 5 



Number of Cycles 


25 


30 


Sample Name 


2T 


2N 


2T 


2N 


CA11 


365 


31118 


6345 


61742 


p-Actin 


710 


562 


25115 


20425 


Index Value 


0.0093 


0.083 



[0100] In Table 5, since the IOD values of the band derived from p-actin obtained on the fluorescent image of 2T and 
2N were of the similar level in the PCR cycles of 25 and 30, there was clarified that RNAs could be extracted from all 
samples. However, since the index was less than 1 for both the 25 and 30 cycles of the PCR, there was clarified that 
CA1 1 was a gene of which expression level was reduced owing to canceration even also with patients with a signet ring 
cell gastric cancer. With respect to 13 kinds of cancer-associated genes other than CA11, there was found to be a 
change in the expression level in the same manner as in Section 2) of Example 1, so that there was clarified that the 
change in the expression level of each of the 1 4 kinds of genes as clarified in Section 3) of Example 1 was not a change 
peculiar to the tissue of the patient tested in Section 1) of Example 1 . 

Example 3 



Construction of Kit for Detecting Cancer 



[0101] A kit for detecting a cancer utilizing RT-PCR method comprising the following components was constructed. 
[0102] Specifically, a kit comprises DNasel, AMV reverse transcriptase, RNase inhibitor, 10 x RT-PCR buffer (100 mM 
Tris-HCI, pH 8.3, 500 mM KCI), 25 mM MgCI 2 , and a mixture of 2.5 mM each of dATP, dGTP, dCTP and dTTR an 
oligo(dT) primer, Taq DNA polymerase, a primer pair specific to each of the genes and a primer pair for amplifying p- 
actin gene as a positive control shown in Table 2. In the column of the primer pair in Table 2, a symbol of a combination 
of an alphabet and a numeral indicates the name of a primer, and a number within a parenthesis following each symbol 
indicates SEQ ID NO: showing the nucleotide sequence of the primer in Sequence Listing. 

INDUSTRIAL APPLICABILITY 



[0103] According to the present invention, it is made possible to simply and rapidly detect cancer. In addition, the pres- 
ence of a novel nucleic acid associated with the cancer is elucidated. 



17 



EP 0 972 830 A1 

EQUIVALENT 

SEQUENCE LISTING 

SEQ ID NO: 1 
SEQUENCE LENGTH: 738 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

CCTCTGTCCA CTGCTTTCGT GAAGACAAGA TGAAGTTCAC AATTGTCTTT GCTGGACTTC 60 
TTGGAGTCTT TCTAGCTCCT GCCCTTGCTA ACTATAATAT CAACGTCAAT GATGACAACA 120 
ACAATGCTGG AAGTGGGCAG CAGTCAGTGA GTGTCAACAA TGAACACAAT GTGGCCAATG 180 
TTGACAATAA CAACGGATGG GACTCCTGGA ATTCCATCTG GGATTATGGA AATGGCTTTG 240 
CTGCAACCAG ACTCTTTCAA AAGAAGACAT GCATTGTGCA CAAAATGAAC AAGGAAGTCA 300 
TGCCCTCCAT TCAATCCCTT GATGCACTGG TCAAGGAAAA GAAGCTTCAG GGTAAGGGAC 360 
CAGGAGGACC ACCTCCCAAG GGCCTGATGT ACTCAGTCAA CCCAAACAAA GTCGATGACC 420 
TGAGCAAGTT CGGAAAAAAC ATTGCAAACA TGTGTCGTGG GATTCCAACA TACATGGCTG 480 
AGGAGATGCA AGAGGCAAGC CTGTTTTTTT ACTCAGGAAC GTGCTACACG ACCAGTGTAC 540 
TATGGATTGT GGACATTTCC TTCTGTGGAG ACACGGTGGA GAACTAAACA ATTTTTTAAA 600 
GCCACTATGG ATTTAGTCGT CTGAATATGC TGTGCAGAAA AAATATGGGC TCCAGTGGTT 660 
TTTACCATGT CATTCTGAAA TTTTTCTCTA CTAGTTATGT TTGATTTCTT TAAGTTTCAA 720 
TAAAATCATT TAGCATTG 

738 

SEQ ID NO: 2 

SEQUENCE LENGTH: 2042 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 

CCGTGACAAC ACTCCTGTCA TATTGGAGTC CAAAACTTGA ATTCTGGGTT GAATTTTTTA 60 
AAAATCAGGT ACCACTTGAT TTCATATGGG AAATTGAAGC AGGAAATATT GAGGGCTTCT 120 
TGATCACAGA AAACTCAGAA GAGATAGTAA TGCTCAGGAC AGGAGCGGCA GCCCCAGAAC 180 
AGGCCACTCA TTTAGAATTC TAGTGTTTCA AAACACTTTT GTGTGTTGTA TGGTCAATAA 240 
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CATTTTTCAT TACTGATGGT GTCATTCACC CATTAGGTAA ACATTCCCTT TTAAATGTTT 300 
GTTTGTTTTT TGAGACAGGA TCTCACTCTG TTGCCAGGGC TGTAGTGCAG TGGTGTGATC 360 
ATAGCTCACT GCAACCTCCA CCTCCCAGGC TCAAGCCTCC CGAATAGCTG GGACTACAGG 420 
CGCACACCAC CATCCCCGGC TAATTTTTGT ATTTTTTGTA GAGACGGGGT TTTGCCATGT 480 
TGCCAAGGCT GGTTTCAAAC TCCTGGACTC AAGAAATCCA CCCACCTCAG CCTCCCAAAG 540 
TGCTAGGATT ACAGGCATGA GCCACTGCGC CCAGCCCTTA TAAATTTTTG TATAGACATT 600 
CCTTTGGTTG GAAGAATATT TATAGGCAAT ACAGTCAAAG TTTCAAAATA GCATCACACA 660 
AAACATGTTT ATAAATGAAC AGGATGTAAT GTACATAGAT GACATTAAGA AAATTTGTAT 720 
GAAATAATTT AGTCATCATG AAATATTTAG TTGTCATATA AAAACCCACT GTTTGAGAAT 780 
GATGCTACTC TGATCTAATG AATGTGAACG TGTAGATGTT TTGTGTGTAT TTTTTTAAAT 840 
GAAAACTCAA AATAAGACAA GTAATTTGTT GATAAATATT TTTAAAGATA ACTCAGCATG 900 
TTTGTAAAGC AGGATACATT TTACTAAAAG GTTCATTGGT TCCAATCACA GCTCATAGGT 960 
AGAGCAAAGA AAGGGTGGAT GGATTGAAAA GATTAGCNTN TGTNTCGGTG GCAGGTTCCC 1020 
ACNTCGCAAG CAATTGGAAA CAAAANTTTN GGGGAGTTTT ATTTTGCATT NGGGTGTGTT 1080 
TTATGTTAAG CAAAACATAN TTTAGAANCA AATGAAAAAG GCAATTGAAA ATCCCAGNTA 1140 
TTTCACCTAG ATGGNATAGC CACCNTGAGC AGAACTTNGT GATGNTTCAT TCTGNGGAAT 1200 
TTTGTGCTTN CTACTGTATA GTGCATGTGG TGTAGGTTAC TCTAACTGGT TTTGTNGACG 1260 
TAAACATTTA AAGTGTTATA TTTTTTATAA AAATGTTTAT TTTTAATGAT ATGAGAAAAA 1320 
TTTTGTTAGG CCACAAAAAC ACTGCACTGT GAACATTTTA GAAAAGGTAT GTCAGACTGG 1380 
GATTAATGAC AGCATGATTT TCAATGACTG TAAATTGCGA TAAGGAAATG TACTGATTGC 1440 
CAATACACCC CACCCTCATT ACATCATCAG GACTTGAAGC CAAGGGTTAA CCCAGCAAGC 1500 
TACAAAGAGG GTGTGTCACA CTGAAACTCA ATAGTTGAGT TTGGCTGTTG TTGCAGGAAA 1560 
ATGATTATAA CTAAAAGCTC TCTGATAGTG CAGAGACTTA CCAGAAGACA CAAGGAATTG 1620 
TACTGAAGAG CTATTACAAT CCAAATATTG CCGTTTCATA AATGTAATAA GTAATACTAA 1680 
TTCACAGAGT ATTGTAAATG GTGGATGACA AAAGAAAATC TGCTCTGTGG AAAGAAAGAA 1740 
CTGTCTCTAC CAGGGTCAAG AGCATGAACG CATCAATAGA AAGRACTCGG GGAAACATCC 1800 
CATCAACAGG ACTACACACT TGTATATACA TTCTTGAGAA CACTGCAATG TGAAAATCAC 1860 
GTTTGCTATT TATAAACTTG TCCTTAGATT AATGTGTCTG GACAGATTGT GGGAGTAAGT 1920 
GATTCTTCTA AGAATTAGAT ACTTGTCACT GCCTATACCT GCAGCTGAAC TGAATGGTAC 1980 
TTCGKATGTT AATAGTTGTT CTGATAAATC ATGCAATTAA AATAAAGTGA TGCAACATCT 2040 
TG 2042 



SEQ ID NO: 3 
SEQUENCE LENGTH: 1539 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

ATGTTCGCCG ACCGTTGACT ATTCTCTACA AACCACAAAG ACATTGGAAC ACTATACCTA 60 
TTATTCGGCG CATGAGCTGG AGTCCTAGGC ACAGCTCTAA GCCTCCTTAT TCGAGCCGAG 120 
CTGGGCCAGC CAGGCAACCT TCTAGGTAAC GACCACATCT ACAACGTTAT CGTCACAGCC 180 
CATGCATTTG TAATAATCTT . CTTCATAGTA ATACCCATCA TAATCGGAGG CTTTGGCAAC 240 
TGACTAGTTC CCCTAATAAT CGGTGCCCCC GATATGGCGT TCCCCCGCAT AAACAACATA 300 
AGCTTCTGAC TCTTACCTCC CTCTCTCCTA CTCCTGCTCG CATCTGCTAT AGTAGAGGCC 360 
GGAGCAGGAA CAGGTTGAAC AGTCTACCCT CCCTTAGCAG GGAACTACTC CCACCCTGGA 420 
GCCTCCGTAG ACCTAACCAT CTTCTCCTTA CACCTAGCAG GTGTCTCCTC TATCTTAGGG 480 
GCCATCAATT TCATCACAAC AATTATCAAT ATAAAACCCC CTGCCATAAC CCAATACCAA 540 
ACGCCCCTCT TCGTCTGATC CGTCCTAATC ACAGCAGTCC TACTTCTCCT ATCTCTCCCA 600 
GTCCTAGCTG CTGGCATCAC TATACTACTA ACAGACCGCA ACCTCAACAC CACCTTCTTC 660 
GACCCCGCCG GAGGAGGAGA CCCCATTCTA TACCAACACC TATCCTGATT TTTCGGTCAC 720 
CCTGAAGTTT ATATTCTTAT CCTACCAGGC TTCGGAATAA TCTCCCATAT TGTAACTTAC 780 
TACTCCGGAA AAAAAGAACC ATTTGGATAC ATAGGTATGG TCTGAGCTAT GATATCAATT 840 
GGCTTCCTAG GGTTTATCGT GTGAGCACAC CATATATTTA CAGTAGGAAT AGACGTAGAC 900 
ACACGAGCAT ATTTCACCTC CGCTACCATA ATCATCGCTA TCCCCACCGG CGTCAAAGTA 960 
TTTAGCTGAC TCGCCACACT CCACGGAAGC AATATGAAAT GATCTGCTGC AGTGCTCTGA 1020 
GCCCTAGGAT TCATCTTTCT TTTCACCGTA GGTGGCCTGA CTGGCATTGT ATTAGCAAAC 1080 
TCATCACTAG ACATCGTACT ACACGACACG TACTACGTTG TAGCTCACTT CCACTATGTC 1140 
CTATCAATAG GAGCTGTATT TGCCATCATA GGAGGCTTCA TTCACTGATT TCCCCTATTC 1200 
TCAGGCTACA CCCTAGACCA AACCTACGCC AAAATCCATT TCGCTATCAT ATTCATCGGC 1260 
GTAAATCTAA CTTTCTTCCC ACAACACTTT CTCGGCCTAT CCGGAATGCC CCGACGTTAC 1320 
TCGGACTACC CCGATGCATA CACCACATGA AATATCCTAT CATCTGTAGG CTCATTCATT 1380 
TCTCTAACAG CAGTAATATT AATAATTTTC ATGATTTGAG AAGCCTTCGC TTCGAAGCGA 1440 
AAAGTCCTAA TAGTAGAAGA ACCCTCCATA AACCTGGAGT GACTATATGG ATGCCCCCCA 1500 
CCCTACCACA CATTCGAAGA ACCCGTATAC ATAAAATCT 153 g 

SEQ ID NO: 4 
SEQUENCE LENGTH: 1807 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS ; double 
TOPOLOGY: linear 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

GAATTCTTTC TTCAGCCCAT GTAAACATGA AAATAAGGGT TAAAAATGAC TTCATTATGG 60 
GGAAAAGGGA CAGGATGCAA ATTGTTCAAA TTCCGGGTGG CCGCTGCTCC GGCCTCCGGG 120 
GCCTTGCGGA GACTCACCCC TTCAGCGTCG CTGCCCCCAG CTCAGCTCTT ACTGCGGGCC 180 
GTCCGACGGC GGTCCCATCC TGTCAGGGAC TATGCGGCGC AAACATCTCC TTCGCCAAAA 240 
GCAGGCGCCG CCACCGGGCG CATCGTGGCG GTCATTGGCG CAGTGGTGGA CGTCCAGTTT 300 
GATGAGGGAC TACCACCAAT TCTAAATGCC CTGGAAGTGC AAGGCAGGGA GACCAGACTG 360 
GTTTTGGAGG TGGCCCAGCA TTTGGGTGAG AGCACAGTAA GGACTATTGC TATGGATGGT 420 
ACAGAAGGCT TGGTTAGAGG CCAGAAAGTA CTGGATTCTG GTGCACCAAT CAAAATTCCT 480 
GTTGGTCCTG AGACTTTGGG CAGAATCATG AATGTCATTG GAGAACCTAT TGATGAAAGA 540 
GGTCCCATCA AAACCAAACA ATTTGCTCCC ATTCATGCTG AGGCTCCAGA GTTCATGGAA 600 
ATGAGTGTTG AGCAGGAAAT TCTGGTG ACT GGTATCAAGG TTGTCGATCT GCTAGCTCCC 660 
TATGCCAAGG GTGGCAAAAT TGGGCTTTTT GGTGGTGCTG GAGTTGGCAA GACTGTACTG 720 
ATCATGGAGT TAATCAACAA TGTCGCCAAA GCCCATGGTG GTTACTCTGT GTTTGCTGGT 780 
GTTGGTGAGA GGACCCGTGA AGGCAATGAT TTATACCATG AAATGATTGA ATCTGGTGTT 840 
ATCAACTTAA AAGATGCCAC CTCTAAGGTA GCGCTGGTAT ATGGTCAAAT GAATCAACCA 900 
CCTGGTGCTC GTGCCCGGGT AGCTCTGACT GGGCTGACTG TGGCTGAATA CTTCAGAGAC 960 
CAAGAAGGTC AAGATGTACT GCTATTTATT GATAACATCT TTCGCTTCAC CCAGGCTGGT 1020 
TCAGAGGTGT CTGCATTATT GGGCCGAATC CCTTCTGCTG TGGGCTATCA GCCTACCCTG 1080 
GCCACTGACA TGGGCACTAT GCAGGAAAGA ATTACCACTA CCAAGAAGGG ATCTATCACC 1140 
TCTGTACAGG CTATCTATGT GCCTGCTGAT GACTTGACTG ACCCTGCCCC TGCTACTACG 1200 
TTTGCCCATT TGGATGCTAC CACTGTACTG TCGCGTGCCA TTGCTGAGCT GGGCATCTAT 1260 
CCAGCTGTGG ATCCTCTAGA CTCCACCTCT CGTATCATGG ATCCCAACAT TGTTGGCAGT 1320 
GAGCATTACG ATGTTGCCCG TGGGGTGCAA AAGATCCTGC AGGACTACAA ATCCCTCCAG 1380 
GATATCATTG CCATCCTGGG TATGGATGAA CTTTCTGAGG AAGACAAGTT GACCGTGTCC 1440 
CGTGCACGGA AAATACAGCG TTTCTTGTCT CAGCCATTCC AGGTTGCTGA GGTCTTCACA 1500 
GGTCATATGG GGAAGCTGGT ACCCCTGAAG GAGACCATCA AAGGATTCCA GCAGATTTTG 1560 
GCAGGTGAAT ATGACCATCT CCCAGAACAG GCCTTCTATA TGGTGGGACC CATTGAAGAA 1620 
GCTGTGGCAA AAGCTGATAA GCTGGCTGAA GAGCATTCAT CGTGAGGGGT CTTTGTCCTC 1680 
TGTACTTGTC TCTCTCCTTG CCCCTAACCC AAAAAGCTTC ATTTTTCTAT ATAGGCTGCA 1740 
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CAAGAGCCTT GATTGAAGAT ATATTCTTTC TGAACAGTAT TTAAGGTTTC CAATAAAATC 1800 
GGAATTC 1807 



SEQ ID NO: 5 

SEQUENCE LENGTH: 4992 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 

CCGCGGTGAG CCGCGAGGAA GAGAGGCGAG CGAGAGTGGA GGAGGAGGCG GCGGCTGCGG 60 
GACGGTCCCC AGGAATGTCG CTGCCCCCCC CCCCCCTGCC GTTGAGGAGG AGACGGAGGA 120 
GACCGACGTT GTTAGGGAAG ATGATCCCTA TGATCTGCCG CTGTTTCTGC ACAGAAATGA 180 
GGG AAATACA AAGAACCAAA TACAGTTCTA AATTTGGGAT CTGTATTTTG AGATGATTTT 240 
ATTTTCAGAA TGAGAAGCAT ATCTGGTTAC CTTTATGAAT GTAGAGACAT GAGAAGAGAG 300 
TTATGATGGC AAAAAACAAA GAGCCTCGTC CCCCATCCTA TACCATCAGT ATAGTTGGAC 360 
TCTCTGGGAC TGAAAAAGAC AAAGGTAACT GTGGAGTTGG AAAGTCTTGT TTGTGCAATA 420 
GATTTGTACG CTCAAAAGCA GATGAATATT ATCCAGAGCA TACTTCTGTG CTTAGCACCA 480 
TTGACTTTGG AGGACGAGTA GTAAACAATG ATCACTTTTT GTACTGGGGT GACATAATAC 540 
AAAATAGTGA AGATGGAGTA GAATGCAAAA TTCATGTCAT TGAACAAACA GAGTTCATTG 600 
ATGACCAGAC TTTCTTGCCT CATCGGAGTA CGAATTTGCA ACCATATATA AAACGTGCAG 660 
CTGCATCTAA ATTGCAGTCA GCAGAAAAAC TAATGTACAT TTGCACTGAT CAGCTAGGCT 720 
TAGAACAAGA CTTTGAACAG AAGCAAATGC CTGAAGGGAA GCTCAACGTA GATGGATTTT 780 
TATTATGCAT TGATGTAAGT CAAGGATGCA ATAGGAAGTT TGATGATCAA CTTAAATTTG 840 
TGAATAACCT TTTTGTCCAG TTATCAAAAT CAAAAAAACC TGTAATAATA GCAGC AACTA 900 
AATGTGATGA ATGCGTGGGT CATTATCTTA GAGAAGTTCA GGCATTTGCT TCAAATAAAA 960 
AGAACCTTCT TGTAGTGGAA ACACTCAGCG CAATAAAAGT CAACATTGAA ACATGTTTTA 1020 
CTGCACTGGT ACAAATGTTG GATAAAACTC GTAGCAAGCC TAAAATTATT CCCTATTTGG 1080 
ATGCTTATAA AACACAGAGA CAACTTGTTG TCACAGCAAC AGATAAGTTT GAAAAACTTG 1140 
TGCAGACTGT GAGAGATTAT CATGCAACTT GGAAAACTGT TAGTAATAAA TTAAAAAATC 1200 
ATCCTGATTA TGAAGAATAC ATCAACTTAG AGGGAACAAG AAAGGCCAGA AATACATTCT 1260 
CAAAACATAT AGAACAACTT AAACAGGAAC ATATAAGAAA AAGGAGAGAA GAGTATATAA 1320 
ATACTTTACC AAGAGCTTTT AACACTCTTT TGCCAAATCT AGAAGAGATT GAACATTTGA 1380 
ATTGGTCAGA AGCTTTGAAG TTAATGGAAA AGAGAGCAGA TTTCCAGTTA TGTTTTGTGG 1440 
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TGCTAGAAAA AACTCCTTGG GATGAAACTG ACCATATAGA CAAAATTAAT GATAGGCGGA 1500 
TTCCATTTGA CCTCCTGAGC ACTTTAGAAG CTGAAAAAGT CTATCAGAAC CATGTACAGC 1560 
ATCTGATATC CGAGAAGAGG AGGGTGGAAA TGAAGGAAAA ATTCAAAAAG ACTTTGGAAA 1620 
AAATTCAATT CATTTCACCA GGGCAGCCAT GGGAGGAAGT TATGTGCTTT GTTATGGAGG 1680 
ATGAAGCCTA CAAATATATC ACTGAGGCTG ATAGCAAAGA GGTATATGGT AGGCATCAGC 1740 
GAGAAATAGT TGAAAAAGCC AAAGAAGAGT TTCAAGAAAT GCTTTTTGAG CATTCTGAAC 1800 
TTTTTTATGA TTTAGATCTT AATGCAACAC CTAGTTCAGA TAAAATGAGT GAAATTCATA 1860 
CAGTTCTGAG TGAAGAACCT AGATATAAAG CTTTACAGAA ACTTGCACCT GATAGGGAAT 1920 
CCCTTCTACT TAAGCATATA GGATTTGTTT ATCATCCCAC TAAAGAAACA TGTCTTAGTG 1980 
GCCAAAATTG TACAGACATT AAAGTGGAGC AGTTACTTGC TAGTAGTCTT TTACAGTTGG 2040 
ATCATGGCCG CTTAAGATTA TATCACGATA GTACCAATAT AGATAAAGTT AACCTTTTTA 2100 
TTTTAGGGAA GGATGGCCTT GCCCAAGAAC TAGCAAATGA GATAAGGACA CAATCCACTG 2160 
ATGATGAGTA TGCCTTAGAT GGAAAAATTT ATGAACTTGA TCTTCGGCCG GTTGATGCCA 2220 
AATCGCCTTA CTTTTTGAGT CAGTTATGGA CTGCCGCCTT TAAACCACAT GGGTGCTTCT 2280 
GTGTATTTAA TTCCATTGAG TCATTGAGTT TTATTGGGGA ATTTATTGGG AAAATAAGAA 2340 
CTGAAGCTTC TCAGATCAGA AAAGATAAAT ACATGGCTAA TCTTCCATTT ACATTAATTC 2400 
TGGCTAATCA GAGAGATTCC ATTAGTAAGA ATCTACCAAT TCTCAGGCAC CAAGGGCAGC 2460 
AGTTGGCAAA CAAGTTGCAA TGTCCTTTTG TAGATGTACC TGCTGGTACA TATCCTCGTA 2520 
AATTTAATGA AACCCAAATA AAGCAAGCTC TCAGAGGAGT ATTGGAATCA GTTAAACACA 2580 
ATTTGGATGT GGTGAGCCCA ATTCCJGCCA ATAAGGACTT ATCAGAAGCT GACTTGAGAA 2640 
TTGTCATGTG CGCCATGTGT GG AGATC CAT TTAGTGTGGA TCTTATTCTT TCACCCTTCC 2700 
TTGATTCTCA TTCTTGCAGT GCTGCTCAAG CTGGACAGAA TAATTCCCTA ATGCTTGATA 2760 
AAATCATTGG TGAAAAAAGG AGGCGAATAC AGATCACAAT ATTATCATAC CACTCTTCAA 2820 
TTGGAGTAAG AAAAGATGAA CTAGTTCATG GGTATATATT AGTTTACTCT GCAAAACGGA 2880 
AAGCTTCGAT GGGAATGCTT CGAGCATTTC TATCAGAAGT TCAAGACACC ATTCCTGTAC 2940 
AGCTGGTGGC AGTTACTGAC AGCCAAGCAG ATTTTTTTGA AAATGAGGCT ATCAAAGAGT 3000 
TAATGACTGA AGGAGAACAC ATTGCAACTG AGATCACTGC TAAATTTACA GCACTGTATT 3060 
CTTTATCTCA GTATCATCGG CAAACTGAGG TCTTTACTCT GTTTTTTAGT GATGTTCTAG 3120 
AGAAAAAAAA TATGATAGAA AATTCTTATT TGTCTGATAA TACAAGGGAA TCAACCCATC 3180 
AAAGTGAAGA TGTTTTTCTA CCATCTCCCA GAGACTGTTT TCCCTATAAT AACTACCCTG 3240 
ATTCAGATGA TGACACAGAA GCACCACCTC CTTATAGTCC AATTGGGGAT GATGTACAGT 3300 
TGCTTCCAAC ACCTAGTGAC CGTTCCAGAT ATAGATTAGA TTTGGAAGGA AATGAGTATC 3360 
CTATTCATAG TACCCCAAAC TGTCATGACC ATGAACGCAA CCATAAAGTG CCTCCACCTA 3420 
TTAAACCTAA ACCAGTTGTA CCTAAGACAA ATGTGAAAGC GCTCGTTCCA AACCTTTTAA 3480 
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GGGCAATTGA AGCTGGTATT GGTAAAAATC CAAGAAAGCA GACTTCCCGG GTGCCTTTCG 3540 
GTCCTGAAGA TATGGATCCT TCAGATAACT ATGCGGAACC CATTGATACA ATTTTCAAAC 3600 
AGAAGGGCTA TTCTGATGAG ATTTATGTTG TCCCAGATGA TAGTCAAAAT CGTATTAAAA 3660 
TTCGAAACTC ATTTGTAAAT AACACCCAAG GAGATGAAGA AAATGGGTTT TCTGATAGAC 3720 
CTCAAAAAGT CATGGGGAAC GGAGGCCTTC AAAATACAAA TATAAATCTA AAACCTTGTT 3780 
TAGTAAAGCC AAGTCATACT ATAGAAGAAC ACATTCAGAT GCCAGTGATG ATGAGGCTTT 3840 
CACCACTTCT AAAACCAAAA AGAAAAGGAA GACATCGTGG AAGTGAAGAA GATCCACTTC 3900 
TTTCTCCTGT TGAAACTTGG AAAGGTGGTA TTGATAATCC TGCAATCACT TCTGACCAGG 3960 
AGTTAGATGA TAAGAAGATG AAGAAGAAAA CCCACAAAGT GAAAGAAGAT AAAAAAAAGA 4020 
AAACTAAGAA CTTCAATCCA CCAACACGTA GAAATTGGGA AAGTAATTAC TTTGGGATGC 4080 
CCCTCCAGGA TCTGGTTACA GCTGAGAAGC CCATACCACT ATTTGTTGAG AAATGTGTGG 4140 
AATTTATTGA AGATACAGGG TTATGTACCG AGAGACTCTA CCGTGTCAGC GGGAATAAAA 4200 
CTGACCAAGA AAATATTCAA AAGCAGTTTG TTCAAGATCA TAATATCAAT CTAGTGTCAA 4260 
TGGAAGTAAC AGTAAATGCT GTAGCTGGAG CCCTTAAAGC TTTCTTTGCA GATCTGCCAG 4320 
ATCCTTTAAT TCCATATTCT CTTCATCCAG AACTATTGGA AGCAGCAAAA ATCCCGGATA 4380 
AAACAGAACG TCTTCATGCC TTGAAAGAAA TTGTTAAGAA ATTTCATCCT GTAAACTATG 4440 
ATGTATTCAG ATACGTGATA ACACATCTAA ACAGGGTTAG TCAGCAACAT AAAATCAACC 4500 
TAATGACAGC AGACAACTTA TCCATCTGTT TTGGCCAACC CTTGATGAGA CCTGATTTGA 4560 
AATCGATGGA GTTTCTGTCT ACTACTAAGA TTCATCAATC TGTTGTTGAA ACATTCATTC 4620 
AGCAGTGTCA GTTTTTCTTT TACAATGGAG AAATTGTAGA AACGACAAAC ATTGTGGCTC 4680 
CTCCACCACC TTCAAACCCA GGACAGTTGG TGGAACCAAT GGTGCCACTT CAGTTGCCGC 4740 
CACCATTGCA ACCTCAGCTG ATACAACCAC AATTACAAAC GGATCCTCTT GGTATTATAT 4800 
GAGTAGGAAG TGATTGCAAA CAGGCTGGAT TTGGACAAAA AGCAAATCTA GACATGCATG 4860 
TTTCAGGGTT CAGTAGTATA CTTCATGTTT CATACAGATA ATTCACATTC AAAATTACAT 4920 
TTTCTCTTTG AACTAGATGG TATTCCTTAT TCACTTACAT TACAAATCTA AGACCATGTG 4980 
ATAAGCATGA CT 

4992 

SEQ ID NO: 6 

SEQUENCE LENGTH: 708 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 
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TATGGCACAT GCAGCGCAAG TAGGTCTACA AGACGCTACT TCCCCTATCA TAGAAGAGCT 60 

TATCACCTTT CATGATCACG CCCTCATAAT CATTTTCCTT ATCTGCTTCC TAGTCCTGTA 120 

TGCCCTTTTC CTAACACTCA CAACAAAACT AACTAATACT AACATCTCAG ACGCTCAGGA 180 

AATAGAAACC GTCTGAACTA TCCTGCCCGC CATCATCCTA GTCCTCATCG CCCTCCCATC 240 

CCTACGCATC CTTTACATAA CAGACGAGGT CAACGATCCC TCCCTTACCA TCAAATCAAT 300 

TGGCCACCAA TGGTACTGAA CCTACGAGTA CACCGACTAC GGCGGACTAA TCTTCAACTC 360 

CTACATACTT CCCCCATTAT TCCTAGAACC AGGCGACCTG CGACTCCTTG ACGTTGACAA 420 

TCGAGTAGTA CTCCCGATTG AAGCCCCCAT TCGTATAATA ATTACATCAC AAGACGTCTT 480 

GCACTCATGA GCTGTCCCCA CATTAGGCTT AAAAACAGAT GCAATTCCCG GACGTCTAAA 540 

CCAAACCACT TTCACCGCTA CACGACCGGG GGTATACTAC GGTCAATGCT CTGAAATCTG 600 

TGGAGCAAAC CACAGTTTCA TGCCCATCGT CCTAGAATTA ATTCCCCTAA AAATCTTTGA 660 

AATAGGGCCC GTATTTACCC TATAGCACCC CCTCTACCCC CTCTAGAG 708 

SEQ ID NO: 7 
SEQUENCE LENGTH: 1140 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 



25 TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to raRNA 
SEQUENCE DESCRIPTION: 





ATGACCCCAA 


TACGCAAAAT 


TAACCCCCTA ATAAAATTAA TTAACCACTC ATTCATCGAC 


60 


30 


CTCCCCACCC 


CATCCAACAT 


CTCCGCATGA 


TGAAACTTCG 


GCTCACTCCT 


TGGCGCCTGC 


120 




CTGATCCTCC 


AAATCACCAC 


AGGACTATTC 


CTAGCCATGC 


ACTACTCACC 


AGACGCCTCA 


180 




ACCGCCTTTT 


CATCAATCGC 


CCACATCACT 


CGAGACGTAA 


ATTATGGCTG 


AATCATCCGC 


240 


35 


TACCTTCACG 


CCAATGGCGC 


CTCAATATTC 


TTTATCTGCC 


TCTTCCTACA 


CATCGGGCGA 


300 


GGCCTATATT 


ACGGATCATT 


TCTCTACTCA 


GAAACCTGAA 


ACATCGGCAT 


TATCCTCCTG 


360 




CTTGCAACTA 


TAGCAACAGC 


CTTCATAGGT 


TATGTCCTCC 


CGTGAGGCCA 


AATATCATTC 


420 




TGAGGGGCCA 


CAGTAATTAC 


AAACTTACTA 


TCCGCCATCC 


CATACATTGG 


GACAGACCTA 


480 


40 


GTTCAATGAA 


TCTGAGGAGG 


CTACTCAGTA 


GACAGTCCCA 


CCCTCACACG 


ATTCTTTACC 


540 




TTTCACTTCA 


TCTTGCCCTT 


CATTATTGCA 


ACCCTAGCAG 


CACTCCACCT 


CCTATTCTTG 


600 




CACGAAACGG 


GATCAAACAA 


CCCCCTAGGA 


ATCACCTCCC 


ATTCCGATAA 


AATCACCTTC 


660 




CACCCTTACT 


ACACAATCAA 


AGACACCCTC 


GGCTTACTTC 


TCTTCCTTCT 


CTCCTTAATG 


720 


45 


ACATTAACAC 


TATTCTCACC 


AGACCTCCTA 


GGCGACCCAG 


ACAATTATAC 


CCTAGCCAAC 


780 




CCCTTAAACA 


CCCCTCCCCA 


CATCAAGCCC 


GAATGATATT 


TCCTATTCGC 


CTACACAATT 


840 



50 



55 
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CTCCGATCCG TCCCTAACAA ACTAGGAGGC GTCCTTGCCC TATTACTATC CATCCTCATC 900 
CTAGCAATAA TCCCCATCCT CCATATATCC AAACAACAAA GCATAATATT TCGCCCACTA 960 
AGCCAATCAC TTTATTGACT CCTAGCCGCA GACCTCCTCA TTCTAACCTG AATCGGAGGA 1020 
CAACCAGTAA GCTACCCTTT TACCATCATT GGACAAGTAG CATCCGTACT ATACTTCACA 1080 
ACAATCCTAA TCCTAATACC AACTATCTCC CTAATTGAAA ACAAAATACT CAAATGGGCC 1140 

SEQ ID NO: 8 
SEQUENCE LENGTH: 5629 
SEQUENCE TYPE : nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear ' 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

GCGCGACCGT CCCGGGGGTG GGGCCGGGCG CAGCGGCGAG AGGAGGCGAA GGTGGCTGCG 60 
GTAGCAGCAG CGCGGCAGCC TCGGACCCAG CCCGGAGCGC AGGGCGGCCG CTGCAGGTCC 120 
CCGCTCCCCT CCCCGTGCGT CCGCCCATGG CCGCCGCCGG GCAGCTGTGC TTGCTCTACC 180 
TGTCGGCGGG GCTCCTGTCC CGGCTCGGCG CAGCCTTCAA CTTGGACACT CGGGAGGACA 240 
ACGTGATCCG GAAATATGGA GACCCCGGGA GCCTCTTCGG CTTCTCGCTG GCCATGCACT 300 
GGCAACTGCA GCCCGAGGAC AAGCGGCTGT TGCTCGTGGG GGCCCCGCGC GGAGAAGCGC 360 
TTCCACTGCA GAGAGCCAAC AGAACGGGAG GGCTGTACAG CTGCGACATC ACCGCCCGGG 420 
GGCCATGCAC GCGGATCGAG TTTGATAACG ATGCTGACCC CACGTCAGAA AGCAAGGAAG 480 
ATCAGTGGAT GGGGGTCACC GTCCAGAGCC AAGGTCCAGG GGGCAAGGTC GTGACATGTG 540 
CTCACCGATA TGAAAAAAGG CAGCATGTTA ATACGAAGCA GGAATCCCGA GACATCTTTG 600 
GGCGGTGTTA TGTCCTGAGT CAGAATCTCA GGATTGAAGA CGATATGGAT GGGGGAGATT 660 
GGAGCTTTTG TGATGGGCGA TTGAGAGGCC ATGAGAAATT TGGCTCTTGC CAGCAAGGTG 720 
TAGCAGCTAC TTTTACTAAA GACTTTCATT ACATTGTATT TGGAGCCCCG GGTACTTATA 780 
ACTGGAAAGG GATTGTTCGT GTAGAGCAAA AGAATAACAC TTTTTTTGAC ATGAACATCT 840 
TTGAAGATGG GCCTTATGAA GTTGGTGGAG AGACTGAGCA TGATGAAAGT CTCGTTCCTG 900 
TTCCTGCTAA CAGTTACTTA GGTTTTTCTT TGGACTCAGG GAAAGGTATT GTTTCTAAAG 960 
ATGAGATCAC TTTTGTATCT GGTGCTCCCA GAGCCAATCA CAGTGGAGCC GTGGTTTTGC 1020 
TGAAGAGAGA CATGAAGTCT GCACATCTCC TCCCTGAGCA CATATTCGAT GGAGAAGGTC 1080 
TGGCCTCTTC ATTTGGCTAT GATGTGGCGG TGGTGGACCT CAACAAGGAT GGGTGGCAAG 1140 
ATATAGTTAT TGGAGCCCCA CAGTATTTTG ATAGAGATGG AGAAGTTGGA GGTGCAGTGT 1200 
ATGTCTACAT GAACCAGCAA GGCAGATGGA ATAATGTGAA GCCAATTCGT CTTAATGGAA 1260 
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CCAAAGATTC TATGTTTGGC ATTGCAGTAA AAAATATTGG AGATATTAAT CAAGATGGCT 1320 
ACCCAGATAT TGCAGTTGGA GCTCCGTATG ATGACTTGGG AAAGGTTTTT ATCTATCATG 1380 
GATCTGCAAA T GG AATAAAT ACCAAACCAA CACAGGTTCT CAAGGGTATA TCACCTTATT 1440 
TTGGATATTC AATTGCTGGA AACATGGACC TTGATCGAAA TTCCTACCCT GATGTTGCTG 1500 
TTGGTTCCCT CTCAGATTCA GTAACTATTT TCAGATCCCG GCCTGTGATT AATATTCAGA 1560 
AAACCATCAC AGTAACTCCT AACAGAATTG ACCTCCGCCA GAAAACAGCG TGTGGGGCGC 1620 
CTAGTGGGAT ATGCCTCCAG GTTAAATCCT GTTTTGAATA TACTGCTAAC CCCGCTGGTT 1680 
ATAATCCTTC AATATCAATT GTGGGCACAC TTGAAGCTGA AAAAGAAAGA AGAAAATCTG 1740 
GGCTATCCTC AAGAGTTCAG TTTCGAAACC AAGGTTCTGA GCCCAAATAT ACTCAAGAAC 1800 
TAACTCTGAA GAGGCAGAAA CAGAAAGTGT GCATGGAGGA AACCCTGTGG CTACAGGATA 1860 
ATATCAGAGA TAAACTGCGT CCCATTCCCA TAACTGCCTC AGTGGAGATC CAAGAGCCAA 1920 
GCTCTCGTAG GCGAGTGAAT TCACTTCCAG AAGTTCTTCC AATTCTGAAT TCAGATGAAC 1980 
CCAAGACAGC TCATATTGAT GTTCACTTCT TAAAAGAGGG ATGTGGAGAC GACAATGTAT 2040 
GTAACAGCAA CCTTAAACTA GAATATAAAT TTTGCACCCG AGAAGGAAAT CAAGACAAAT 2100 
TTTCTTATTT ACCAATTCAA AAAGGTGTAC CAGAACTAGT TCTAAAAGAT CAGAAGGATA 2160 
TTGCTTTAGA AATAACAGTG ACAAACAGCC CTTCCAACCC AAGGAATCCC ACAAAAGATG 2220 
GCGATGACGC CCATGAGGCT AAACTGATTG CAACGTTTCC AGACACTTTA ACCTATTCTG 2280 
CATATAGAGA ACTGAGGGCT TTCCCTGAGA AACAGTTGAG TTGTGTTGCC AACCAGAATG 2340 
GCTCGCAAGC TGACTGTGAG CTCGGAAATC CTTTTAAAAG AAATTCAAAT GTCACTTTTT 2400 
ATTTGGTTTT AAGTACAACT GAAGTCACCT TTGACACCCC ATATCTGGAT ATTAATCTGA 2460 
AGTTAGAAAC AACAAGCAAT CAAGATAATT TGGCTCCAAT TACAGCTAAA GCAAAAGTGG 2520 
TTATTGAACT GCTTTTATCG GTCTCGGGAG TTGCTAAACC TTCCCAGGTG TATTTTGGAG 2580 
GTACAGTTGT TGGCGAGCAA GCTATGAAAT CTGAAGATGA AGTGGGAAGT TTAATAGAGT 2640 
ATGAATTCAG GGTAATAAAC TTAGGTAAAC CTCTTACAAA CCTCGGCACA GCAACCTTGA 2700 
ACATTCAGTG GCCAAAAGAA ATTAGCAATG GGAAATGGTT GCTTTATTTG GTGAAAGTAG 2760 
AATCCAAAGG ATTGGAAAAG GTAACTTGTG AGCCACAAAA GGAGATAAAC TCCCTGAACC 2820 
TAACGGAGTC TCACAACTCA AGAAAGAAAC GGGAAATTAC T GAAAAAC AG ATAGATGATA 2880 
ACAGAAAATT TTCTTTATTT GCTGAAAGAA AATACCAGAC TCTTAACTGT AGCGTGAACG 2940 
TGAACTGTGT GAACATCAGA TGCCCGCTGC GGGGGCTGGA CAGCAAGGCG TCTCTTATTT 3000 
TGCGCTCGAG GTTATGGAAC AGCACATTTC TAGAGGAATA TTCCAAACTG AACTACTTGG 3060 
ACATTCTCAT GCGAGCCTTC ATTGATGTGA CTGCTGCTGC CGAAAATATC AGGCTGCCAA 3120 
ATGCAGGCAC TCAGGTTCGA GTGACTGTGT TTCCCTCAAA GACTGTAGCT CAGTATTCGG 3180 
GAGTACCTTG GTGGATCATC CTAGTGGCTA TTCTCGCTGG GATCTTGATG CTTGCTTTAT 3240 
TAGTGTTTAT ACTATGGAAG TGTGGTTTCT TCAAGAGAAA TAAGAAAGAT CATTATGATG 3300 
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CCACATATCA CAAGGCTGAG ATCCATGCTC AGCCATCTGA TAAAGAGAGG CTTACTTCTG 3360 
ATGCATAGTA TTGATCTACT TCTGTAATTG TGTGGATTCT TTAAACGCTC TAGGTACGAT 3420 
GACAGTGTTC CCCGATACCA TGCTGTAAGG ATCCGGAAAG AAGAGCGAGA GATCAAAGAT 3480 
GAAAAGTATA TTGATAACCT TGAAAAAAAA CAGTGGATCA CAAAGTGGAA CAGAAATGAA 3540 
AGCTACTCAT AGCGGGGGCC TAAAAAAAAA AAAGCTTCAC AGTACCCAAA CTGCTTTTTC 3600 
CAACTCAGAA ATTCAATTTG GATTTAAAAG CCTGCTCAAT CCCTGAGGAC TGATTTCAGA 3660 
GTGACTACAC ACAGTACGAA CCTACAGTTT TAACTGTGGA TATTGTTACG TAGCCTAAGG 3720 
CTCCTGTTTT GCACAGCCAA ATTTAAAACT GTTGGAATGG ATTTTTCTTT AACTGCCGTA 3780 
ATTTAACTTT CTGGGTTGCC TTTGTTTTTG GCGTGGCTGA CTTACATCAT GTGTTGGGGA 3840 
AGGGCCTGCC CAGTTGCACT CAGGTGACAT CCTCCAGATA GTGTAGCTGA GGAGGCACCT 3900 
ACACTCACCT GCACTAACAG AGTGGCCGTC CTAACCTCGG GCCTGCTGCG CAGACGTCCA 3960 
TCACGTTAGC TGTCCCACAT CACAAGACTA TGCCATTGGG GTAGTTGTGT TTCAACGGAA 4020 
AGTGCTGTCT TAAACTAAAT GTGCAATAGA AGGTGATGTT GCCATCCTAC CGTCTTTTCC 4080 
TGTTTCCTAG CTGTGTGAAT ACCTGCTCAC GTCAAATGCA TACAAGTTTC ATTCTCCCTT 4140 
TCACTAAAAA CACACAGGTG CAACAGACTT GAATGCTAGT TATACTTATT TGTATATGGT 4200 
ATTTATTTTT TCTTTTCTTT ACAAACCATT TTGTTATTGA CTAACAGGCC AAAGAGTCTC 4260 
CAGTTTACCC TTCAGGTTGG TTTAATCAAT CAGAATTAGA ATTAGAGCAT GGGAGGGTCA 4320 
TCACTATGAC CTAAATTATT TACTGCAAAA AGAAAATCTT TATAAATGTA CCAGAGAGAG 4380 
TTGTTTTAAT AACTTATCTA TAAACTATAA CCTCTCCTTC ATGACAGCCT CCACCCCACA 4440 
ACCCAAAAGG TTTAAGAAAT AGAATTATAA CTGTAAAGAT GTTTATTTCA GGCATTGGAT 4500 
ATTTTTTACT TTAGAAGCCT GCATAATGTT TCTGGATTTA CATAC TGTAA CATTCAGGAA 4560 
TTCTTGGAGA AGATGGGTTT ATTCACTGAA CTCTAGTGCG GTTTACTCAC TGCTGCAAAT 4620 
ACTGTATATT CAGGACTTGA AAGAAATGGT GAATGCCTAT GGAACTAGTG GATCCAAACT 4680 
GATCCAGTAT AAGACTACTG AATCTGCTAC CAAAACAGTT AATCAGTGAG TCGAGTGTTC 4740 
TATTTTTTGT TTTGTTTCCT CCCCTATCTG TATTCCCAAA AATTACTTTG GGGCTAATTT 4800 
AACAAGAACT TTAAATTGTG TTTTAATTGT AAAAATGGCA GGGGGTGGAA TTATTACTCT 4860 
ATACATTCAA CAGAGACTGA ATAGATATGA AAGCTGATTT TTTTTAATTA CCATGCTTCA 4920 
CAATGTTAAG TTATATGGGG AGCAACAGCA AACAGGTGCT AATTTGTTTT GGATATAGTA 4980 
TAAGCAGTGT CTGTGTTTTG AAAGAATAGA ACACAGTTTG TAGTGCCACT GTTGTTTTGG 5040 
GGGGGGCTTT TTTTCTTTTT CCGGAAAATC CTTAAACCTT AAGATACTAA GGACGTTGTT 5100 
TTGGTTGTAC TTGGAATTCT TAGTCACAAA ATATATTTTG TTTACAAAAA TTTCTGTAAA 5160 
ACAGGTTATA ACAGTGTTTA AAGTCTCAGT TTCTTGCTTG GGGAACTTGT GTCCCTAATG 5220 
TGTTAGATTG CTAGATTGCT AAGGAGCTGA TACTTGACAG TTTTTTAGAC CTGTGTTACT 5280 
AAAAAAAAGA TGAATGTCGG AAAAGGGTGT TGGGAGGGTG GTCAACAAAG AAACAAAGAT 5340 
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GTTATGGTGT TTAGACTTAT GGTTGTTAAA AATGTCATCT CAAGTCAAGT CACTGGTCTG 5400 
TTTGCATTTG ATACATTTTT GTACTAACTA GCATTGTAAA ATTATTTCAT GATT AG AAAT 5460 
TACCTGTGGA TATTTGTATA AAAGTGTGAA ATAAATTTTT TATAAAAGTG TTCATTGTTT 5520 
CGTAACACAG CATTGTATAT GTGAAGCAAA CTCTAAAATT ATAAATGACA ACCTGAATTA 5580 
TCTATTTCAT CAAAAAAAAA AAAAAAAAAA ACTTTATGGG CACAACTGG 5629 

SEQ ID NO: 9 
SEQUENCE LENGTH: 580 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 1 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

CCATCCAATG AGGCCACCTC TTTCTAAACT CAGACTCTTC ATTTAGGGAG GTGAGTTCCA 60 
TTAAGGAACT TGAGATTTTC AGATAAATGG AAAATACTAG ATAAAGAGGT ATCTCATAGA 120 
TAGCAAAGGT AAACTCTCAT ACAATCATTG AGCTAGGACA TTAATGGTTC AGTGGTTCCC 180 
AATTCTAGAT AT AC ATT AAA ATAAATTGAA AAGCCTTTTA AAAATACATG ATTACTGGAC 240 
CTACTGAATT ATATCCTTTG GGGAGCCCAA G A ACT T ATT A AATTCTCTGG GCTATTTTTA 300 
TGATTTCTCT GAGCTGTTAC TGGGAACTAC TGATTGAATC CATYTTTTAT AGTAATGTTT 360 
CCAACAGAAG GCTGTTTSCC TTTGCTTAAC ATTATTTCCA GTGAAGTATT ATTTTCCATT 420 
CTGGAGACAG TTCAAAAGTT TTTTTAAGTA ACAGCTTTAT TGAGACAATT TATATSCCGT 480 
ACAATTCACC TAAAGTGTGT AATTCAGTTG TTTTTAGTAT GTTCACAGAA TTGTGCAGCT 540 
TGCATCTATC ACCACAAATT TAGAACCTTG TCATAATCCC 580 

SEQ ID NO: 10 

SEQUENCE LENGTH: 1552 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 

CCCAAACCCA CTCCACCTTA CTACCAGACA ACCTTAGCCA AACCATTTAC CCAAATAAAG 60 
TATAGGCGAT AGAAATTGAA ACCTGGCGCA ATAGATATAG TACCGCAAGG GAAAGATGAA 120 
AAATTATAAC CAAGCATAAT ATAGCAAGGA CTAACCCCTA TACCTTCTGC ATAATGAATT 180 
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AACTAGAAAT AACTTTGCAA GGAGAGCCAA AGCTAAGACC CCCGAAACCA GACGAGCTAC 240 
CTAAGAACAG CTAAAAGAGC ACACCCGTCT ATGTAGCAAA ATAGTGGGAA GATTTATAGG 300 
TAGAGGCGAC AAACCTACCG AGCCTGGTGA TAGCTGGTTG TCCAAGATAG AATCTTAGTT 360 
CAACTTTAAA TTTGCCCACA GAACCCTCTA AATCCCCTTG TAAATTTAAC TGTTAGTCCA 420 
AAGAGGAACA GCTCTTTGGA CACTAGGAAA AAACCTTGTA GAGAGAGTAA AAAATTTAAC 480 
ACCCATAGTA GGCCTAAAAG CAGCCACCAA TTAAGAAAGC GTTCAAGCTC AACACCCACT 540 
ACCTAAAAAA TCCCAAACAT ATAACTGAAC TCCTCACACC CAATTGGACC AATCTATCAC 600 
CCTATAGAAG AACTAATGTT AGTATAAGTA ACATGAAAAC ATTCTCCTCC GCATAAGCCT 660 
GCGTCAGATT AAAACACTGA ACTGACAATT AACAGCCCAA TATCTACAAT CAACCAACAA 720 
GTCATTATTA CCCTCACTGT CAACCCAACA CAGGCATGCT CATAAGGAAA GGTTAAAAAA 780 
AGTAAAAGGA ACTCGGCAAA TCTTACCCCG CCTGTTTACC AAAAACATCA CCTCTAGCAT 840 
CACCAGTATT AGAGGCACCG CCTGCCCAGT GACACATGTT TAACGGCCGC GGTACCCTAA 900 
CCGTGCAAAG GTAGCATAAT CACTTGTTCC TTAATTAGGG ACCCGTATGA ATGGCTCCAC 960 
GAGGGTTCAG CTGTCTCTTA CTTTTAACCA GTGAAATTGA CCTGCCCGTG AAGAGGCGGG 1020 
CATGACACAG CAAGACGAGA AGACCCTATG GAGCTTTAAT TTATTAATGC AAACAGTACC 1080 
TAACAAACCT ACAGGTCCTA AACTACCAAA CCTGCATTAA AAATTTCGGT TGGGGCGACC 1140 
TCGGAGCAGA ACCCAACCTC CGAGCAGTAC ATGCTAAGAC TTCACCAGTC AAAGCGAACT 1200 
ACTATACTCA ATTGATCCAA TAACTTGACC AACGGAACAA GTTACCCTAG GGATAACAGC 1260 
GCAATCCTAT TCTAGAGTCC ATATCAACAA TAGGGTTTAC GACCTCGATG TTGGATCAGG 1320 
ACATCCCGAT GGTGCAGCCG CTATTAAAGG TTCGTTTGTT CAACGATTAA AGTCCTACGT 1380 
GATCTGAGTT CAGACCGGAG TAATCCAGGT CGGTTTCTAT CTACTTCAAA TTCCTCCCTG 1440 
TACGAAAGGA CAAGAGAAAT AAGGCCTACT TCACAAAGCG CCTTCCCCCG TAAATGATAT 1500 
CATCTCAACT TAGTATTATA CCCACACCCA CCCAAGAACA GGGTTTGTTA AG 1 552 

SEQ ID NO: 11 
SEQUENCE LENGTH: 2116 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

GGGTGGCAGA ATATTAGTCT AGCTATCTCC CATTGCTCTC ACGCGCCATC TACTGGATTT 60 
CATCCCAAAC TACAACACGA AAAACTGCTA ATTTTCCTGC CTGCCAGGCC GAGGACTGGA 120 
ATTCAACAGA CTGTTTAGAG CCTTTGCCCT CTGAAAACTT CCAGAAATGA AGCCAACTGA 180 
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CTATATTCAG TTTACACCAG AGTTAAAGGA ACGCCAACCC TCCCAGATGA GAAAGAATCA 240 
GTGCAAGAAC TGTAGCAATT TAAAAAACCA GAGCGTCCCC TTACCTCCAA ATGAGCCCAC 300 
TAGCTCCACA GCAATTGTTC TTAACCAATC TGAAATGATG AGCATGGAAT TCAGAATCTG 360 
AATGGCAATG AAGCTTATAG ATATCCAGGA GAAAGTTGAA ATGCAATCCA AGGAAACCAA 420 
GCAATCCAGT GAAATGGTTT AAGAGCTGAA AGATAAAATA NCAATTTTAC AAAAGACCCA 480 
AACTGAGCTT ATTGAGTTCA AAAAAGAATT TCATAATACA ATCAGAAGTA TTAATAGCAG 540 
AATAGGCCAA GCTGAGGAAA GAATCTCAGA GCTTGACCCC TGGTTCTTTG AATCAACTTA 600 
GACAAAAATA AAGAAAAAAG AGTTTTAAGA AATGAACACA ATCTCCCAGA AATATGAGAT 660 
TATGTWAAGA GACAAAATCT ATGACTCATT GCCATCCCTG AGAGAGAAGG AGAGAGAATA 720 
AGCAACTTGG AAAATATATT TGGGGACATA GCCCACAAAA ATTTCCCTAA TCTCTCTAGA 7 BO 
GAGGTTGACA TGTAAATTCA AGAAATACAG AAGACCTTGG CCAGATAATA TACAAGATGA 840 
CCATCCCCAA GGCACATAGT CATCAGATTC ACCATGGTCA ATGCAAAAGA AAAAAATCTT 900 
AAAGACAGCT AGGGAGAAGG GTCAAGTCAC ATGCAGAAGG ACTCTCATTA GGCTGGCAGT 960 
GGACCTCTCA GCAGAAACCT GACAAGCCAG AAGAGATGGA GGGAGAGGGG TCTATTTTTG 1020 
TCATCCTTAA AGAAAAAAAA TTCCAACCAA GAGTCTCATA CACTGCCAAA CTAAGCTTCC 1080 
TAAGTGAAGG AGAAATAAAA ACCTTCTCAG ACAAGCAAAT GCTGAAGGAA TTCAACTAGA 1140 
CCAGCCTAAC AAGAGGTCCT AAGGGAGTGC TGAATATGGA CTCAAAAGAA TAACACCTGC 1200 
TACCACAAAC ACTCACTTAA GCACACAGCC CAACGACACT ATAGGCAATT ACACAGTAAG 1260 
TCTACATAAC AACACAATGA CAGGATCAAC ATCTCACACA TCAATACTAA CCCCGAGTGT 1320 
AAAGGGGCTA AATGCCCCAC TTAAAAGACA TAGAGTGTCA AGCTTGATAA AAAGACAAGA 1380 
TCCAATCATC CACTATTTTC AAGAGCTCTA TGTTATGTGT AATGACACCC ACAGACTCAA 1440 
AGACTTGGAG AAAGATTTAT CATGCAAAAT CAGAAAACAA AAAAGAGCAG GAGTCACTAG 1500 
TTTTATATCA GACAAAACAG ACTTTAAACC CTTAATAATT AAGAAAGACA AAGAAGGGTA 1560 
TTTCCTGGAC CACAGAAGGC TTATTGGAAA AAAGGACATA ATGACAAAGG GTACAATCCA 1620 
ACAAGAAGTT TTAACTATTC TAAATATATA CACACCCAAC ATTGGAGCAC CCAGATTTAT 1680 
AAAACAAGTA CTTCTCGATC TACAAGAAGA CTTAGACAGC CACACAATAA TAGTGGGAGA 1740 
CTTTCACATC CTACTTACAG ATCATTGAGA CAGAAAACTA ATAAAAGAAC TCTGGACTTA 1800 
AACTTGTTAC TTGACCAATT GGACCTAATA GATATCCACA GAAAACTTCA CCCAACAAAG 1860 
ACAGAATATA CATTCTTCTT ATCTGCACAT GGAACACATT CCAAGATCAA T C ACATGCT A 1920 
GGTAAGAAAG CAAGTCTCAA TAAATTAAAA AAAATTGAAA TCATACGAAC CTTAATATCA 1980 
GACCACAATG TAATTAAAAA TAAATCAATA TCAAGAAGAT CTCATACATA AATACATGAA 2040 
AATTAAACAA CTTACTCCTG AATAACTCTT GTGTGAACAT CAAAATTCAG GAAGAAATAA 2100 
AAAATTATTT GAAATT ?i i fi 
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SEQ ID NO: 12 
SEQUENCE LENGTH: 173 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULAR TYPE: cDNA to mRNA 
SEQUENCE DESCRIPTION: 

GCGATCCACA AATGGGAGGT GACGGTCCAT CAGGGAAGCT GGGTTCGCGG CTCCACGGCT 60 
GGGGGCTGCC GCAATTTCCT GGATACCTTT TGGACCAATC CACAAATAAA AT^TG .20 
ACTGAGAAAG ATGAGGGGCA GGAGGAGTGT AGTTTCCTTG TAGCCCTGAT "3 

SEQ ID NO: 13 

SEQUENCE LENGTH: 655 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 

CTGATCCATG GGCCAGCAGC ATCAATATTA CCTGGGAGCT TACAGAAATG CAGAATTTCA 60 
CCCCCACTGC AGATCTACCG AATCAAAATC TTCCTTTAGC AAAATTTCTC. AAACGATTAG 1 
CAC™ CATCCATTTT ATCCTTCCTT AGCTATTAGG GATGTGAGGT CCGAGGGCTT 

ACTCCAGCAA TGaT" TTGTTCCTT ° ™ CAT ™- 

ZlZTr T TGAGCTGAAA CTCATTCATC ^CCTTGCTG AGTTTTCTTC TCAATCCTTA 300 

IZZlr, TGAGCCCTAC CTACCCAGTG GTT ^TTTT TGTAGCCAGT 360 

GTGGGACACA GGAGATTGGC AGACCAACAC AGCTAGCCTC TCTCTAGCCC TCCCTCCACC 420 

acItT TAACAATCCA TGTTTGTTCA GT — c a ~ tg ™ 

Icr^TZ Zlr m catttcagaa aaatgtgtac taagttaaaa CCATGTTTAG 540 

cZZT, CTTGTACATT TTCATTT ^T CTTATCAGTA GATTGTCCTT GTTGACATAG 600 
CTCATGCATG AGGACACATA GCAGTACACA CACATTGAAT GAATTGTTAG TCATG 



655 



SEQ ID NO: 14 
SEQUENCE LENGTH: 2619 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
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TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION: 

GACTCCTAGG GGCTTGCAGA CCTAGTGGGA GAGAAAGAAC ATCGCAGCAG CCAGGCAGAA 60 
CCAGGACAGG TGAGGTGCAG GCTGGCTTTC CTCTCGCAGC GCGGTGTGGA GTCCTGTCCT 120 
GCCTCAGGGC TTTTCGGAGC CTGGATCCTC AAGGAACAAG TAGACCTGGC CGCGGGGAGT 180 
GGGGAGGGAA GGGGTGTCTA TTGGGCAACA GGGCGGCAAA GCCCTGAATA AAGGGGCGCA 240 
GGGCAGGCGC AAGTGCAGAG CCTTCGTTTG CCAAGTCGCC TCCAGACCGC AGACATGAAA 300 
CTTGTCTTCC TCGTCCTGCT GTTCCTCGGG GCCCTCGGAC TGTGTCTGGC TGGCCGTAGG 360 
AGAAGGAGTG TTCAGTGGTG CGCCGTATCC CAACCCGAGG CCACAAAATG CTTCCAATGG 420 
CAAAGGAATA TGAGAAAAGT GCGTGGCCCT CCTGTCAGCT GCATAAAGAG AGACTCCCCC 480 
ATCCAGTGTA TCCAGGCCAT TGCGGAAAAC AGGGCCGATG CTGTGACCCT TGATGGTGGT 540 
TTCATATACG AGGCAGGCCT GGCCCCCTAC AAACTGCGAC CTGTAGCGGC GGAAGTCTAC 600 
GGGACCGAAA GACAGCCACG AACTCACTAT TATGCCGTGG CTGTGGTGAA GAAGGGCGGC 660 
AGCTTTCAGC TGAACGAACT GCAAGGTCTG AAGTCCTGCC ACACAGGCCT TCGCAGGACC 720 
GCTGGATGGA ATGTCCCTAC AGGGACACTT CGTCCATTCT TGAATTGGAC GGGTCCACCT 780 
GAGCCCATTG AGGCAGCTGT GGCCAGGTTC TTCTCAGCCA GCTGTGTTCC CGGTGCAGAT 840 
AAAGGACAGT TCCCCAACCT GTGTCGCCTG TGTGCGGGGA CAGGGGAAAA CAAATGTGCC 900 
TTCTCCTCCC AGGAACCGTA CTTCAGCTAC TCTGGTGCCT TCAAGTGTCT GAGAGACGGG 960 
GCTGGAGACG TGGCTTTTAT CAGAGAGAGC ACAGTGTTTG AGGACCTGTC AGACGAGGCT 1020 
GAAAGGGACG AGTATGAGTT ACTCTGCCCA GACAACACTC GGAAGCCAGT GGACAAGTTC 1080 
AAAGACTGCC ATCTGGCCCG GGTCCCTTCT CATGCCGTTG TGGCACGAAG TGTGAATGGC 1140 
AAGGAGGATG CCATCTGGAA TCTTCTCCGC CAGGCACAGG AAAAGTTTGG AAAGGACAAG 1200 
TCACCGAAAT TCCAGCTCTT TGGCTCCCCT AGTGGGCAGA AAGATCTGCT GTTCAAGGAC 1260 
TCTGCCATTG GGTTTTCGAG GGTGCCCCCG AGGATAGATT CTGGGCTGTA CCTTGGCTCC 1320 
GGCTACTTCA CTGCCATCCA GAACTTGAGG AAAAGTGAGG AGGAAGTGGC TGCCCGGCGT 1380 
GCGCGGGTCG TGTGGTGTGC GGTGGGCGAG CAGGAGCTGC GCAAGTGTAA CCAGTGGAGT 1440 
GGCTTGAGCG AAGGCAGCGT GACCTGCTCC TCGGCCTCCA CCACAGAGGA CTGCATCGCC 1500 
CTGGTGCTGA AAGGAGAAGC TGATGCCATG AGTTTGGATG GAGGATATGT GTACACTGCA 1560 
TGCAAATGTG GTTTGGTGCC TGTCCTGGCA GAGAACTACA AATCCCAACA AAGCAGTGAC 1620 
CCTGATCCTA ACTGTGTGGA TAGACCTGTG GAAGGATATC TTGCTGTGGC GGTGGTTAGG 1680 
AGATCAGACA CTAGCCTTAC CTGGAACTCT GTGAAAGGCA AGAAGTCCTG CCACACCGCC 1740 
GTGGACAGGA CTGCAGGCTG GAATATCCCC ATGGGCCTGC TCTTCAACCA GACGGGCTCC 1800 
TGCAAATTTG ATGAATATTT CAGTCAAAGC TGTGCCCCTG GGTCTGACCC GAGATCTAAT 1860 
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CTCTGTGCTC TGTGTATTGG CGACGAGCAG GGTGAGAATA AGTGCGTGCC CAACAGCAAC 1920 
GAGAGATACT ACGGCTACAC TGGGGCTTTC CGGTGCCTGG CTGAGAATGC TGGAGACGTT 1980 
GCATTTGTGA AAGATGTCAC TGTCTTGCAG AACACTGATG GAAATAACAA TGAGGCATGG 2040 
GCTAAGGATT TGAAGCTGGC AGACTTTGCG CTGCTGTGCC TCGATGGCAA ACGGAAGCCT 2100 
GTGACTGAGG CTAGAAGCTG CCATCTTGCC ATGGCCCCGA ATCATGCCGT GGTGTCTCGG 2160 
ATGGATAAGG TGGAACGCCT GAAACAGGTG CTGCTCCACC AACAGGCTAA ATTTGGGAGA 2220 
AATGGATCTG ACTGCCCGGA CAAGTTTTGC TTATTCCAGT CTGAAACCAA AAACCTTCTG 2280 
TTCAATGACA ACACTGAGTG TCTGGCCAGA CTCCATGGCA AAACAACATA TGAAAAATAT 2340 
TTGGGACCAC AGTATGTCGC AGGCATTACT AATCTGAAAA AGTGCTCAAC CTCCCCCCTC 2400 
CTGGAAGCCT GTGAATTCCT CAGGAAGTAA AACCGAAGAA GATGGCCCAG CTCCCCAAGA 2460 
AAGCCTCAGC CATTCACTGC CCCCAGCTCT TCTCCCCAGG TGTGTTGGGG CCTTGGCTCC 2520 
CCTGCTGAAG GTGGGGATTG CCCATCCATC TGCTTACAAT TCCCTGCTGT CGTCTTAGCA 2580 
AGAAGTAAAA TGAGAAATTT TGTTGATATT CAAAAAAAA 2619 

SEQ ID NO: 15 
SEQUENCE LENGTH: 892 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULAR TYPE: CDNA to mRNA 
SEQUENCE DESCRIPTION: 

TCTTGACCGG CACACACAGC TCGCTTCTTC ACTTTCTTTT CCATCCACTG CCGGACCCAA 60 
GCCAGCCTTC CAGGGAGCAG CCATGCCTTA CCTCTACCGG GCCCCAGGGC CTCAGGCACA 120 
CCCGGTTCCC AAGGACGCCC GGATCACCCA CTCCTCAGGC CAGARCTTTG ARCAAATGAA 180 
GCAGGARTGC CTGCAGARAR GCACCCTGTT TGAGGATGCA GACTTCCCAG CCAGCAATTC 240 
CTCCCTGTTC TACAGTGAGA GGCCGCAGAT CCCCTTTGTG TGGAAACGAC CARGGGAAAT 300 
CGTGAAAAAC CCARAATTCA TTCTTGGAGG GGCCACCAGG ACTGATATCT GCCAGGGAGA 360 
GCTGGGAGAC TGCTGGCTAT TAGCCGCCAT CGCCTCCCTT ACGCTTAATC AAAAAGCACT 420 
GGCCAGAGTC ATCCCCCAGG ACCAAAGCTT TGGCCCTGGT TATGCCGGGA TATTCCATTT 480 
CCAGTTCTGG CAGCACAGTG AGTGGCTGGA CGTGGTGATC GATGACCGCC TGCCCACCTT 540 
CAGGGACCGC TTGGTTTTCC TCCACTCTGC CGACCACAAC GARTTCTGGA RCGCCTTGCT 600 
GGAAAAAGCC TACGCCAAGC TAAATGGGAG CTATGAAGCT CTGAAGGGAG GCAGCGCCAT 660 
CGAGGCCATG GAAGACTTCA CTGGGGGTGT GGCAGAGACC TTCCAAACTA AAGAGGCCCC 720 
CGAGAACTTC TATGAGATTC TAGAGAAGGC TTTGAAGANA NGCTCCCTGC TGGGCTGCTT 780 
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CATTGATACC AGAAGTGCTG CAGAATCTGA GGCCCGGACG CCGTTTGGTC TTATTAAGGG 840 
TCATGCCTAC AGTGTAACGG GAATTGACCA GGTAAGCTTC CGAGGCCAGA GA l 92 

SEQ ID NO: 16 

SEQUENCE LENGTH: 508 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 

MOLECULAR TYPE: cDNA to mRNA 

SEQUENCE DESCRIPTION : 

TGGAGAATGC GAGCCGGGTG TTCCAGGCTC TCAGTACAAA GAACANGGAG TTCATTCATN 60 
TCAATATAAA NGAGTTCATC CATTNGACAA TGAACATCTG AGGCTGCNTT GTAGAGATGC 120 

^ITrr GNTGAATCTG CCTNGACCTT <*™»™* CTTGGTGTGG 180 

AACCATTACG CCCAGGGTTC ACTCCCCTCT CATCGTCCGG CCTTCTCCCT TCATCTTGAT 240 
CTGGGAAGAA TGAAATGAAC TCAGCTACAC TCTCTGATTT TGTGCTACTC CTTTGTAAAG 300 

IZlZTrr AAGGGGGCTG ATGGCGCCAC CTGTGCCTTA CATCCAGGTT CAGGCATCAC 3 0 

C~ A C ACACTCTACT TTCCTTATTT CCTTCCATTA AGAATTACTC AGAG ™=™ 420 
CCtG ' mCC ATGTA ^TCC AGTCATTGTG ATCAGACATC CTTTATAAAA 480 
CATGTTTTTA TAAATGTGTA TGTGGAAT 



508 



SEQ ID NO: 17 
SEQUENCE LENGTH: 194 
SEQUENCE type: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Ser val His Cys Phe Arg Glu Asp Lys Met Lys Phe Thr He Val 

15 10 15 

Phe Ala Gly Leu Leu Gly Val Phe Leu Ala Pro Ala Leu Ala Asn 



20 25 



30 



Tyr Asn lie Asn Val Asn Asp Asp Asn Asn Asn Ala Gly Ser Gly 
35 40 45 

Gin Gin Ser Val Ser Val Asn Asn Glu His Asn Val Ala Asn Val 
50 « 60 
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Asp Asn Asn Asn Gly Trp Asp Ser Trp Asn Ser lie Trp Asp Tyr 

65 70 75 

Gly Asn Gly Phe Ala Ala Thr Arg Leu Phe Gin Lys Lys Thr Cys 

80 85 90 

lie Val His Lys Met Asn Lys Glu Val Met Pro Ser lie Gin Ser 

95 100 105 

Leu Asp Ala Leu Val Lys Glu Lys Lys Leu Gin Gly Lys Gly Pro 

n 0 115 120 

Gly Gly Pro Pro Pro Lys Gly Leu Met Tyr Ser Val Asn Pro Asn 

125 130 135 

Lys Val Asp Asp- Leu Ser Lys Phe Gly Lys Asn He Ala Asn Met 

140 145 150 

Cys Arg Gly He Pro Thr Tyr Met Ala Glu Glu. Met Gin Glu Ala 

!55 160 165 

Ser Leu Phe Phe Tyr Ser Gly Thr Cys Tyr Thr Thr Ser Val Leu 

170 175 180 

Trp He Val Asp He Ser Phe Cys Gly Asp Thr Val Glu Asn 

185 190 

SEQ ID NO: 18 
SEQUENCE LENGTH: 51 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION : 

Met Val Asp Asp Lys Arg Lys Ser Ala Leu Trp Lys Glu Arg Thr 
1 5 io 15 

Val Ser Thr Arg Val Lys Ser Met Asn Ala Ser He Glu Arg Thr 

20 25 30 

Arg Gly Asn He Pro Ser Thr Gly Leu His Thr Cys He Tyr He 

35 40 45 

Leu Glu Asn Thr Ala Met 

50 
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SEQ ID NO: 19 
SEQUENCE LENGTH: 63 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Gly Gin Gin His Gin Tyr Tyr Leu Gly Ala Tyr Arg Asn Ala 
1 5 10 15 

Glu Phe Gin Ala His Cys Arg Ser Thr Glu Ser Lys Ser Ser Phe 

20 25 3 0 

Ser Lys He Ser Gin Thr He Ser Thr Gly Leu His Pro Phe Tyr 

35 40 45 

Pro Ser Leu Ala He Arg Asp Val Arg Ser Glu Gly Phe Lys Arg 

50 55 60 

Ser Pro Glu 



SEQ ID NO: 20 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TCTTTGCTGG ACTTCTTGGA 20 

SEQ ID NO: 21 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

CTTTGTTTGG GTTGACTGAG 20 
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SEQ ID NO: 22 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CACCCTCATT ACATCATCAG 

SEQ ID NO: 23 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
ATTCCTTGTG TCTTCTGGTA 

SEQ ID NO: 24 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE; nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION; 
CAGTCCTACT TCTCCTATCT C 

SEQ ID NO: 25 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 



synthetic DNA 
20 



synthetic DNA 
20 



synthetic DNA 
21 



synthetic DNA 
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ATCATAGCTC AGACCATACC T 

SEQ ID NO: 26 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GATCCTGCAG GACTACAAAT C 

SEQ ID NO: 27 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GCCTATATAG AAAAATGAAG 

SEQ ID NO: 28 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CACCTAGTGA CCGTTCCAGA T 

SEQ ID NO: 29 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



synthetic DNA 
21 



synthetic DNA 
20 



synthetic DNA 
21 
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MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 
TTCATCTCCT TGGGTGTTAT T 

SEQ ID NO: 30 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CTCAGACGCT CAGGAAATAG A 

SEQ ID NO: 31 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
AATGGGGGAA GTATGTAGGA G 

SEQ ID NO: 32 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TTACGGATCA TTTCTCTACT C 21 

SEQ ID NO: 33 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 



synthetic DNA 
21 



synthetic DNA 
21 
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STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
AGGGCAAGAT GAAGTGAAAG G 

SEQ ID NO: 34 

SEQUENCE LENGTH: 21 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear ' 

MOLECULE TYPE: other nucleic acid 

SEQUENCE DESCRIPTION: 

TCCGGAAAGA AGAGCGAGAG A 

SEQ ID NO: 35 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TGAAACACAA CTACCCCAAT G 

SEQ ID NO: 36 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
ATAGCAAAGG TAAACTCTCA 

SEQ ID NO: 37 



synthetic DNA 
21 



synthetic DNA 
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synthetic DNA 
21 



synthetic DNA 
20 



41 



EP 0 972 830 A1 



SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TCAATCAGTA GTTCCCAGTA 

SEQ ID NO: 38 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TTAACAGCCC AATATCTACA 

SEQ ID NO: 39 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GAACAAGTGA TTATGCTACC 

SEQ ID NO: 40 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
AGAATAAGCA ACTTGGAAAA 



synthetic DNA 
20 



synthetic DNA 
20 



synthetic DNA 
20 



synthetic DNA 
20 
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SEQ ID NO: 41 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TGAATCTGAT GACTATGTGC 20 

SEQ ID NO: 42 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TCCTGGATAC CTTTTGGACC 

SEQ ID NO: 43 
SEQUENCE LENGTH: 19 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CATCAGGGCT ACAAGGAAA 

SEQ ID NO: 44 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 



synthetic DNA 
20 
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CAGATCTACC GAATCAAAAT C 

SEQ ID NO: 45 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
ACCAGAATTA GGAATAAGGA T 

SEQ ID NO: 46 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GACTCCATGG CAAAACAACA 

SEQ ID NO: 47 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TCTTCTTCGG TTTTACTTCC 2 0 

SEQ ID NO: 48 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



synthetic DNA 
21 



synthetic DNA 
20 
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MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 
AGGCACCAGG GCGTGATGGT 

SEQ ID NO: 49 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GGTCTCAAAC ATGATCTGGG 

SEQ ID NO: 50 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CTTGATTGCC 

SEQ ID NO: 51 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

AGGTGACCGT 10 

SEQ ID NO: 52 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: nucleic acid 



synthetic DNA 
20 



synthetic DNA 
10 
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STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GTTGCGATCC 

SEQ ID NO: 53 

SEQUENCE LENGTH: 10 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear " 

MOLECULE TYPE: other nucleic acid 

SEQUENCE DESCRIPTION: 

CTGATCCATG 

SEQ ID NO: 54 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
CTGCTTGATG 

SEQ ID NO: 55 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
GATCTGACTG 

SEQ ID NO: 56 



synthetic DNA 
10 
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SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS; single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TTTTTTTTTT TAA 

SEQ ID NO: 57 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TTTTTTTTTT TAC 

SEQ ID NO: 58 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TTTTTTTTTT TAG 

SEQ ID NO: 59 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION: 
TTTTTTTTTT TCA 



synthetic DNA 
13 



synthetic DNA 
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SEQ ID NO: 60 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TTTTTTTTTT TCC 13 

SEQ ID NO: 61 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TTTTTTTTTT TCG 13 

SEQ ID NO: 62 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 

TTTTTTTTTT TGA 13 

SEQ ID NO: 63 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 
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TTTTTTTTTT TGC 

13 

5 

SEQ ID NO: 64 
SEQUENCE LENGTH: 13 
SEQUENCE TYPE: nucleic acid 
w STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid synthetic DNA 
SEQUENCE DESCRIPTION: 
IS TTTTTTTTTT TGG 

13 

SEQ ID NO: 65 
SEQUENCE LENGTH: 264 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
SEQUENCE DESCRIPTION • 

*~ rrr ~ t — 



20 



30 



cagagcaaga gaggcatcct caccctgaag taccccatcg hGc&crc*r^ CMCCAGGCC 60 

TGGGACGACA TGGAGAAAAT CTGGCAHr ,r l^Z AGCACGGCAT CGTCACCAAC 120 

GAGGAGCACC CCGTGC^T G^A^t r T TOiCA TGTGGCTCCC 180 

ATGACCCAGA TCA^GA G^T 
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Claims 

40 1 



nudeoMe sequence «s shown in any one of SEqTnS??b S£ 7 "* *"* ' S 0 ° NA » 

55 " ^^^^S^^ CharaCteri26d - * * - session leve , c mRNA is 
6. The detection method according to claim 2, character! 



rized in that the change in an expression level of mRNA 



is 
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detected by RNase protection assay. 



7 ' I?.^f n ™ ,h f """"^ *° ^ 1 - " «»l the Chang, in an expression level of a cancer 

assocaed gen. ,s oetermmed by a change in an expression level ol a protein encodjb, saw geTe 

8 -rr:rnr;c^ 

10. A kit for ^detecting a cancer cell by the method of claim 5 or 6, wherein the kit comprises a probe capable of hvbrid 
izmg wrth mRNA of which change in an expression level is to be determined. P ^ 

1 1 ' tlinlTwt^t 9 3 C8nCer ^ ttle m6th0d ° f C ' aim 8 ' Wherein the "* com ? rises an """body cognizing a pro- 
tein of which change in an expression level is to be determined. 

1 2 ' slTr^ TT* Pr °f rati0n ° f 3 C3nCer Ce " USi " 9 3 Substance ^^ly binding to a gene or an expres- 
s.on product of sa.d gene, characterized in that cDNA of the gene is at least one of DNAs selected from a DNA 

rr' n ? h a H nU , Cle ° We ! eqU6nCe 35 Sh ° Wn in any ™ 0f SEQ 10 N0 * : 1 »° 16 in Sequ nS T a DNA 
capable of hybr.diz.ng with a nucleic acid consisting of any one of these nucleotide sequences under st?inoert con 
drtions, wherein the DNA is usable for detection of a cancer cell by a change in an eZesZTe!S 

13. A peptide usable for detection of a cancer cell, characterized in that the peptide is shown in an amino acid 

14. A peptide usable for detection of a cancer cell, characterized in that the peptide has an amino acid seouence com 
acid residues in the ammo acd sequence as shown in SEQ ID NOs: 17 to 19 in Sequence Listing. 

1 5. A nucleic acid encoding the peptide of claim 1 3 or 1 4. 

1 6 ' t^nud^f d aCC ° rdin9 10 C !f ^ 1 5> Characterized in ** * h * nucleic acid has a nucleotide sequence comprising 
siuencTLSng^ UenCe " * ^ "* " MqUBnC8S ° f SEQ ,D NOs: 1 - 2 ' 9 ' 3 13. 15. oM6 in 

1 7 ' a C ^tn a C N "TJZ deteCti0n °' 3 CanMr Ce " by 3 Chan9e in an ex P ression thereof, wherein the nucleic 
acid .s capable of hybridizing with the nucleic acid of claim 15 or 16 under stringent conditions. 

18. An antibody usable for detection of a cancer cell, wherein the antibody recognizes the peptide of claim 13 or 14. 
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